AWS Glue is a serverless ETL service for data analysis:
With AWS Glue, you can discover and connect to more than 70 diverse data sources and manage your data in a centralized data catalog. You can visually create, run, and monitor extract, transform, and load (ETL) pipelines to load data into your data lakes. Also, you can immediately search and query cataloged data using Amazon Athena, Amazon EMR, and Amazon Redshift Spectrum.
Since our nonpartitioned data are already in S3, we can set up Glue to read directly from the bucket with a predefined schema.
We can use AWS Glue to repartition: