Skip to content

Instantly share code, notes, and snippets.

@abhishek-shrm
Created August 3, 2020 16:45
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save abhishek-shrm/8fcadc5b1bd94b33696402367e52f402 to your computer and use it in GitHub Desktop.
Save abhishek-shrm/8fcadc5b1bd94b33696402367e52f402 to your computer and use it in GitHub Desktop.
df=dd.read_table('msmarco-docs.tsv',blocksize=100e6,header=None)
df.columns=['docid','url','title','body']
df.head()
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment