Skip to content

Instantly share code, notes, and snippets.

@eponkratova
Last active March 13, 2022 14:21
Show Gist options
  • Save eponkratova/efc751f3ced161ad42e0dfe86f0a2f8f to your computer and use it in GitHub Desktop.
Save eponkratova/efc751f3ced161ad42e0dfe86f0a2f8f to your computer and use it in GitHub Desktop.
AWS Glue Studio AWS Glue DataBrew
Source -S3 -AWS Glue Data catalog (S3, RDS, Redshift, etc.) -Streaming (AWS Kinesis Data Streams, Kafka) -Manual upload -Direct connection using JDBC -AWS Glue Data catalog (S3, Redshift, RDS) -Amazon Appflow -AWS Data Exchange -Snowflake
Algorithm No information but as per https://www.acf.hhs.gov/sites/default/files/documents/opre/opre-understanding_effect_opioid_epidemic_child_maltreatment-jan2022.pdf, k-mean clustering is used No information
Target -S3 -AWS Glue Data catalog -Connector -S3 -AWS Glue Data catalog -JDBC databases
Pricing AWS Glue ($0.44 per DPU-hour) + crawlers ($0.44 per DPU-hour) + data catalog ($1 per 100,000 objects) $0.48 per DataBrew node hour
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment