Skip to content

Instantly share code, notes, and snippets.

@mgrandi
Last active October 28, 2020 07:15
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save mgrandi/3c2200c6435c33b21a18fd32cc1eb871 to your computer and use it in GitHub Desktop.
Save mgrandi/3c2200c6435c33b21a18fd32cc1eb871 to your computer and use it in GitHub Desktop.
wget-at args
./wget-at \
--user-agent "Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:84.0) Gecko/20100101 Firefox/84.0" \
--page-requisites \
--no-verbose \
--input-file=/home/mgrandi/psstore/wget_at-url_list_{}-{}.txt \
--output-file=/home/mgrandi/psstore/wget-at_output.log \
--no-cookies \
--content-on-error \
--no-check-certificate \
--output-document=/home/mgrandi/psstore/wget.tmp \
-e "robots=off" \
--rotate-dns \
--no-parent \
--timeout=30 \
--tries inf \
--waitretry=10 \
--warc-file=/home/mgrandi/psstore/ps_store_2020_oct_scrape_{}-{} \
--warc-header="operator: mgrandi" \
--warc-header="description: playstation store en-US scrape 2020-10-27" \
--warc-dedup-url-agnostic \
--warc-max-size=5368709120 \
--reject-regex "{}"
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment