Skip to content

Instantly share code, notes, and snippets.

View tonyahowe's full-sized avatar

Tonya Howe tonyahowe

View GitHub Profile
@mjlavin80
mjlavin80 / get_eebo_tcp.py
Last active May 21, 2023 15:28
Download all Github-archived EEBO-TCP xml files from their associated repositories on Github
# Download all Github-archived EEBO-TCP xml files from their associated repositories on Github
# Files were created "by converting TCP files to TEI P5 using tcp2tei.xsl,TEI @ Oxford."
# Running this script requires two preparatory steps. Either could be eliminated with a simple modification
# 1. Creating a destination folder called tcp (all lowercase) that is placed in the same folder as this script
# 2. Downloading "TCP.csv" (all caps filename) from https://github.com/textcreationpartnership/Texts and placing it in the same folder as this script
import requests
import pandas as pd
# comment these lines out if you have the file already