Skip to content

Instantly share code, notes, and snippets.

@ratchetwrench
Created August 9, 2017 00:23
Show Gist options
  • Save ratchetwrench/7b9ef925322ff5530a52621c92604897 to your computer and use it in GitHub Desktop.
Save ratchetwrench/7b9ef925322ff5530a52621c92604897 to your computer and use it in GitHub Desktop.
Convert CSV to Parquet file
import dask.dataframe as dd
import blaze
df = dd.read_csv('csv/yellow_tripdata_2015-*.csv') # '*' all files with the 'csv/yellow_tripdata_2015' prefix
df.to_parquet('yellow_tripdata.parquet')
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment