Skip to content

Instantly share code, notes, and snippets.

@abayomi185
Created September 23, 2022 11:08
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save abayomi185/2a81ec290f09bd5aca1da5e64c061459 to your computer and use it in GitHub Desktop.
Save abayomi185/2a81ec290f09bd5aca1da5e64c061459 to your computer and use it in GitHub Desktop.

Query and analyze data in S3 using standard SQL.

Integrates with AWS Glue Data Catalog to crawl data in S3 and create schemas and tables. This process can be automatic or manual.

PrestoDB - open source SQL query engine with support for multiple relational and non-relational databases.

Examples

Screenshot 2022-09-22 at 13 22 46

Creating a table with a schema manually for a dataset. This can also be done automatically with AWS Glue crawlers

Screenshot 2022-09-22 at 23 00 46

Querying the data in S3 using SQL in the query editor.

Note:

  1. Dissimilar datasets in S3 need to be organised into folders for query to show data based on the crawled table and schema
  2. See AWS Athena FAQ for more info
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment