Skip to content

Instantly share code, notes, and snippets.

@Nithilaa
Created September 20, 2021 03:17
Show Gist options
  • Save Nithilaa/1608ac0a1125254d04d23c56bf52b3db to your computer and use it in GitHub Desktop.
Save Nithilaa/1608ac0a1125254d04d23c56bf52b3db to your computer and use it in GitHub Desktop.
! wget http://mlg.ucd.ie/files/datasets/bbc-fulltext.zip
! unzip bbc-fulltext.zip
project_path = "/text_summarization/"
documents = []
documents_dir = Path('/content/bbc/entertainment')
for file_path in documents_dir.files('*.txt'):
with file_path.open(mode='rt', encoding='utf-8') as fp:
documents.append(fp.readlines())
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment