Skip to content

Instantly share code, notes, and snippets.

@daniarleagk
daniarleagk / .block
Last active May 10, 2021 14:09
Z-Order curve range query
license: mit
height: 1024
@daniarleagk
daniarleagk / .block
Last active March 7, 2021 09:26
Group median spark sql
license:
@daniarleagk
daniarleagk / .block
Last active March 14, 2020 11:37
Export and import data using Apache Spark from and into Apache HBase
license:
@daniarleagk
daniarleagk / .block
Last active March 14, 2020 11:36
Getting started example Apache Spark Streaming and Apache Kafka
license:
@daniarleagk
daniarleagk / .block
Created March 15, 2020 08:12
Spark Java log4j smtp appender and byte-monkey library
license:
@daniarleagk
daniarleagk / .md
Created October 30, 2024 06:51
DuckDB python pipeline example

NOTE: This example code is fictional and meant for illustration purpose only. DuckDB Version: 1.1.2

We have the following scenario. We receive sales transaction data in batches:

transaction_id,trx_ts,item_id,total
0, "2024-07-31 20:00:00",i0_04_00,2
1, "2024-08-01 11:10:00",i0_01_00,2
2, "2024-08-01 11:20:00",i0_02_00,1