Skip to content

Instantly share code, notes, and snippets.

@thicolares
Created December 27, 2018 23:07
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save thicolares/227035cbe22a7f3064e4e3a131351de2 to your computer and use it in GitHub Desktop.
Save thicolares/227035cbe22a7f3064e4e3a131351de2 to your computer and use it in GitHub Desktop.
Download an entire Fotolog profile as of 2018 using wget

Run:

$ wget -r -l5 -E -e robots=off --convert-links --no-parent https://fotolog.com/username/

-r Turn on recursive retrieving.

-l5 Specify recursion maximum depth level depth. 5 is more than necessary, but the --no-parent keeps it under control.

-E (or --adjust-extension) Add the correct extensions to HTML and CSS files.

-e robots=off wget by default honours the robots.txt standard for crawling pages, so we turn it off.

---convert-links AFTER the download is complete, convert the links in the document to make them suitable for local viewing.

--no-parent Don´t download something from the parent directory.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment