This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| # scrape the html files (it's gentle, don't worry) | |
| mkdir court-records | |
| curl https://www.justice.gov/epstein/court-records > court-records/index.html | |
| mkdir doj-disclosures | |
| curl https://www.justice.gov/epstein/doj-disclosures > doj-disclosures/index.html | |
| mkdir foia | |
| curl https://www.justice.gov/epstein/foia > foia/index.html | |
| # translate the html files to lists of pdf links | |
| grep -o -E "https?://[][[:alnum:]._~:/?#@awk&'()*+,;%=-]+.pdf" court-records/index.html | tr " " "\n" | sed 's/\&/\&/g' > court-records/links.txt |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| # Place me in the same directory as your copyparty.exe from https://github.com/9001/copyparty | |
| # Use me by right clicking and selecting "Run with PowerShell" | |
| # For some reason PowerShell devs decided the iwr progress bar should update every byte... which makes this download go from instant to taking like 1 minute. Stupid stupid stupid devs. | |
| # https://stackoverflow.com/questions/28682642/powershell-why-is-using-invoke-webrequest-much-slower-than-a-browser-download | |
| $ProgressPreference = 'SilentlyContinue' | |
| # Set this to the path of the conf file. If it's in the same directory as copyparty.exe, then just put the name of the file. | |
| $ConfPath = "party.conf" |