Skip to content

Instantly share code, notes, and snippets.

@noraj
Created April 26, 2019 18:52
Show Gist options
  • Star 1 You must be signed in to star a gist
  • Fork 1 You must be signed in to fork a gist
  • Save noraj/86df35096aa250ead4f7b3a6e6eb09de to your computer and use it in GitHub Desktop.
Save noraj/86df35096aa250ead4f7b3a6e6eb09de to your computer and use it in GitHub Desktop.
crawl a web page, extract all domains and resolve them to IP addresses with bash and common GNU/Linux tools
#!/bin/bash
url='rawsec.ml'
domains=$(curl $url -s | grep -E 'https?://[^"]*' | cut -d '/' -f 3 | cut -d '"' -f 1 | uniq)
filename='/tmp/temporary_ips.txt'
for domain in $domains
do
dig +noall +answer $domain | awk '/\sA\s/ {print $5}' >> $filename
done
cat $filename | sort -u
@noraj
Copy link
Author

noraj commented Apr 26, 2019

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment