Skip to content

Instantly share code, notes, and snippets.

View imaurer's full-sized avatar

Ian Maurer imaurer

View GitHub Profile
@imaurer
imaurer / us_zips.csv
Created January 6, 2024 23:17 — forked from Tucker-Eric/us_zips.csv
US Zip Codes
We can't make this file beautiful and searchable because it's too large.
zip,city,state,state_abbr,county,count_code,latitude,longitude
99553,Akutan,Alaska,AK,Aleutians East,13,54.143,-165.7854
99571,Cold Bay,Alaska,AK,Aleutians East,13,55.1858,-162.7211
99583,False Pass,Alaska,AK,Aleutians East,13,54.841,-163.4368
99612,King Cove,Alaska,AK,Aleutians East,13,55.0628,-162.3056
99661,Sand Point,Alaska,AK,Aleutians East,13,55.3192,-160.4914
99546,Adak,Alaska,AK,Aleutians West,16,51.88,-176.6581
99547,Atka,Alaska,AK,Aleutians West,16,52.1224,-174.4301
99591,Saint George Island,Alaska,AK,Aleutians West,16,56.5944,-169.6186
99638,Nikolski,Alaska,AK,Aleutians West,16,52.9883,-168.7884
@imaurer
imaurer / us_zips.csv
Created January 6, 2024 23:17 — forked from Tucker-Eric/us_zips.csv
US Zip Codes
We can't make this file beautiful and searchable because it's too large.
zip,city,state,state_abbr,county,count_code,latitude,longitude
99553,Akutan,Alaska,AK,Aleutians East,13,54.143,-165.7854
99571,Cold Bay,Alaska,AK,Aleutians East,13,55.1858,-162.7211
99583,False Pass,Alaska,AK,Aleutians East,13,54.841,-163.4368
99612,King Cove,Alaska,AK,Aleutians East,13,55.0628,-162.3056
99661,Sand Point,Alaska,AK,Aleutians East,13,55.3192,-160.4914
99546,Adak,Alaska,AK,Aleutians West,16,51.88,-176.6581
99547,Atka,Alaska,AK,Aleutians West,16,52.1224,-174.4301
99591,Saint George Island,Alaska,AK,Aleutians West,16,56.5944,-169.6186
99638,Nikolski,Alaska,AK,Aleutians West,16,52.9883,-168.7884
"""coalesce_parquets.py
gist of how to coalesce small row groups into larger row groups.
Solves the problem described in https://issues.apache.org/jira/browse/PARQUET-1115
"""
from __future__ import annotations
from pathlib import Path
from typing import Callable, Iterable, TypeVar