Skip to content

Instantly share code, notes, and snippets.

@jcleveley-zz
Last active August 29, 2015 14:15
Show Gist options
  • Save jcleveley-zz/d69b00fa265a80be5599 to your computer and use it in GitHub Desktop.
Save jcleveley-zz/d69b00fa265a80be5599 to your computer and use it in GitHub Desktop.
Log analysis
pv www.bbc.co.uk | sed -E -e "s/(&|\?|#|\'|\%).*$//g" -e "s/ //g" | Ack "^www.bbc.co.uk/news/" | sort | uniq -c | sort -nr > news_unique.log
cat news_unique.log | sed -E 's/^.+www\.bbc\.co\.uk/ - /g' > news_unique_url.yaml
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment