Skip to content

Instantly share code, notes, and snippets.

@sahlone
sahlone / ALT Click And Conversion S3 ETL.md
Last active June 19, 2018 13:29
Click/Conversions ETL to S3 and Athena

The scope of the gist is to define the process of loading of Clicks/Conversion data from Tracker to S3/Athena for Business processes

The process is described as :

  1. The tracker receives the click/conversions data from outside sources and pushes to Kafka topic

  2. The Matcher reads the Kafka topic produced by the tracker and matches the clicks and conversions data to produce the Matched conversion data

  3. Now the Job handles the data from Kafka topic produced by the Matcher and upload the data to s3. From s3, we can define the schema in Athena and use the Athena to run the SQL queries on top of the data

System Diagram