Skip to content

Instantly share code, notes, and snippets.

@rikturr
Created July 21, 2020 14:34
Show Gist options
  • Save rikturr/63bff03968a6249523bd41ac094338fa to your computer and use it in GitHub Desktop.
Save rikturr/63bff03968a6249523bd41ac094338fa to your computer and use it in GitHub Desktop.
load dask
import dask.dataframe as dd
taxi = dd.read_csv(
's3://nyc-tlc/trip data/yellow_tripdata_2019-01.csv',
parse_dates=['tpep_pickup_datetime', 'tpep_dropoff_datetime'],
).sample(frac=0.1, replace=False)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment