Skip to content

Instantly share code, notes, and snippets.

Show Gist options
  • Save ResidentMario/f6cd4e53c369b9d0de5a8b1eb6087067 to your computer and use it in GitHub Desktop.
Save ResidentMario/f6cd4e53c369b9d0de5a8b1eb6087067 to your computer and use it in GitHub Desktop.
---------------------------------------------------------------------------
IndexError Traceback (most recent call last)
<ipython-input-22-4c19f3a1c827> in <module>
----> 1 import dask.dataframe as dd; dd.read_parquet("s3://2019-nyc-taxi-trips/*/data.parquet", storage_options={"key": "AKIAVKTT2PHGKDQIERXC", "secret": "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"}, engine="fastparquet")
~/opt/miniconda3/envs/dask-local-test-env/lib/python3.7/site-packages/dask/dataframe/io/parquet/core.py in read_parquet(path, columns, filters, categories, index, storage_options, engine, gather_statistics, split_row_groups, chunksize, **kwargs)
234 filters=filters,
235 split_row_groups=split_row_groups,
--> 236 **kwargs
237 )
238
~/opt/miniconda3/envs/dask-local-test-env/lib/python3.7/site-packages/dask/dataframe/io/parquet/fastparquet.py in read_metadata(cls, fs, paths, categories, index, gather_statistics, filters, **kwargs)
206 # correspond to a row group (populated below).
207 parts, pf, gather_statistics, fast_metadata, base_path = _determine_pf_parts(
--> 208 fs, paths, gather_statistics, **kwargs
209 )
210
~/opt/miniconda3/envs/dask-local-test-env/lib/python3.7/site-packages/dask/dataframe/io/parquet/fastparquet.py in _determine_pf_parts(fs, paths, gather_statistics, **kwargs)
145 pf.cats = paths_to_cats(fns, scheme)
146 parts = paths.copy()
--> 147 elif fs.isdir(paths[0]):
148 # This is a directory, check for _metadata, then _common_metadata
149 paths = fs.glob(paths[0] + fs.sep + "*")
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment