Skip to content

Instantly share code, notes, and snippets.

@eliasdorneles
Last active April 20, 2018 10:33
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save eliasdorneles/e55c5b3df59bc9f0bcfa994d043b14ab to your computer and use it in GitHub Desktop.
Save eliasdorneles/e55c5b3df59bc9f0bcfa994d043b14ab to your computer and use it in GitHub Desktop.
Script to run spider, saving log output and stuff
#!/bin/bash
set -e
abort() {
echo "$*"; exit 1;
}
usage() {
echo $*
abort "Usage: $(basename $0) [-h|--help] SPIDER_FILE"
}
require() {
type $1 >/dev/null 2>/dev/null
}
while [ "${1#-}" != "$1" ]; do
case "$1" in
-h|--help) usage;;
*) usage;;
esac
shift
done
require scrapy || abort "Please install scrapy first"
[ -f "$1" ] || abort "Please provide a spider file"
spider="$1"
base=$(basename $1 .py)
output=visited_${spider}-$(date +%Y-%m-%d_%H%M).jl
echo "Running spider, output will be in $output"
(scrapy runspider $spider -o $output 2>&1) > log-$base.txt
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment