Skip to content

Instantly share code, notes, and snippets.

@pbinkley
Created September 27, 2013 18:04
Show Gist options
  • Save pbinkley/6732586 to your computer and use it in GitHub Desktop.
Save pbinkley/6732586 to your computer and use it in GitHub Desktop.
Extracting file from warc by url with warctools
warcindex drupalib.interoperating.info.warc > drupalib.interoperating.info.warc.csv
warcpayload drupalib.interoperating.info.warc:`grep http://drupalib.interoperating.info/files/screencaps/nash-library-thumbnail.jpg drupalib.interoperating.info.warc.csv | grep " response " | head -1 | awk '{ print $2; }'` > nash-library-thumbnail.jpg
identify nash-library-thumbnail.jpg
nash-library-thumbnail.jpg JPEG 320x320 320x320+0+0 8-bit sRGB 21.2KB 0.000u 0:00.000
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment