Skip to content

Instantly share code, notes, and snippets.

@raingloom
Last active May 30, 2016 14:32
Show Gist options
  • Save raingloom/19f62b3aca431d01d8fbac1143314de2 to your computer and use it in GitHub Desktop.
Save raingloom/19f62b3aca431d01d8fbac1143314de2 to your computer and use it in GitHub Desktop.
python 2 href extractor
#downloads some word lists with wget
#cat useragent
#Mozilla/5.0 (X11; Linux x86_64; rv:43.0) Gecko/20100101 Firefox/43.0 Iceweasel/43.0.4
mkdir -p files
python getlinks.py test.xml | wget --verbose --base 'http://www.aciddr0p.net' --input-file=- --directory-prefix=files -nd --show-progress --user-agent="$(cat useragent)"
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment