Skip to content

Instantly share code, notes, and snippets.

@evandiewald
Created February 10, 2022 15:00
Show Gist options
  • Save evandiewald/992b0611812af1045c010f62adf099a5 to your computer and use it in GitHub Desktop.
Save evandiewald/992b0611812af1045c010f62adf099a5 to your computer and use it in GitHub Desktop.
DeWi Grant Bounty: Analytics-Friendly Data Dumps

DeWi Grant Bounty: Analytics-Friendly Data Dumps

Description

Open-source, easily-accessible datasets have been shown to spurn innovation in fields like machine learning and data visualization. While the DeWi Metabase platform is suitable for ad hoc analyses and dashboards, queries can be slow and rate-limited. On the other hand, blockchain-etl allows for direct programmatic access to the entire ledger, but extensive hardware requirements and the lengthy initial sync time present a significant barrier to entry.

This grant serves a gap in Helium’s existing data engineering infrastructure by producing periodic static data dumps of key tables over discrete time periods.

The offering will accelerate exploratory analyses related to anti-gaming, token flow, coverage mapping, data usage, and more by enabling data scientists to interact with manageable datasets locally, in the programming language of their choice.

Deliverables

  • Specifically, this could be as simple as a cron job that queries the DeWi ETL database directly, extracts important tables (e.g. gateway-inventory, last 50k blocks worth of payment transactions), and uploads them in JSON/CSV format to an S3 bucket. It should be easy for us to adjust the queries and their frequency.
  • Add these public links to a static site for download.

Contact

Evan Diewald

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment