Skip to content

Instantly share code, notes, and snippets.

View snorkysnark's full-sized avatar

Francis The Basilisk snorkysnark

  • Novi Sad, Serbia
View GitHub Profile
@snorkysnark
snorkysnark / flatten-openalex-jsonl.py
Last active June 20, 2022 10:39 — forked from richard-orr/flatten-openalex-jsonl.py
flatten openalex JSON Lines files to CSV readable by PostgreSQL
import csv
import glob
import gzip
import json
import os
SNAPSHOT_DIR = 'openalex-snapshot'
CSV_DIR = 'csv-files'
FILES_PER_ENTITY = int(os.environ.get('OPENALEX_DEMO_FILES_PER_ENTITY', '0'))