Skip to content

Instantly share code, notes, and snippets.

@jaehyeon-kim
Created March 16, 2023 20:40
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save jaehyeon-kim/cd6f7617e5399d6ec95f8288f96377ec to your computer and use it in GitHub Desktop.
Save jaehyeon-kim/cd6f7617e5399d6ec95f8288f96377ec to your computer and use it in GitHub Desktop.

1️⃣ The FinnHub Streaming Data Pipeline

2️⃣ Streamify

  • https://github.com/ankurchavda/streamify
  • 💬 The project will stream events generated from a fake music streaming service (like Spotify) and create a data pipeline that consumes the real-time data
  • 💻 Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP

3️⃣ Reddit ETL Pipeline

4️⃣ Audiophile End-To-End ELT Pipeline

5️⃣ Surfline Dashboard

  • https://github.com/andrem8/surf_dash
  • 💬 The pipeline collects data from the surfline API and exports a csv file to S3. Then the most recent file in S3 is downloaded to be ingested into the Postgres datawarehouse. At the end, you obtain a beautiful dashboard showing the data
  • 💻 AWS S3, Airflow, Pandas, Postgres, Ploty
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment