Skip to content

Instantly share code, notes, and snippets.

@lhagan
Created October 13, 2012 18:07
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save lhagan/3885597 to your computer and use it in GitHub Desktop.
Save lhagan/3885597 to your computer and use it in GitHub Desktop.
Bash script to parse Apache log for a count of RSS subscribers and email it to you
#!/bin/sh -e
# --- Required variables ---
RSS_URI="/rss.xml"
MAIL_TO="your@email.com"
LOG_FILE="/home/$USER/var/log/lighttpd/access.log"
LOG_DATE_FORMAT="%d/%b/%Y"
# --- Optional customization ---
MAIL_SUBJECT="RSS feed subscribers"
# Date expression for yesterday
DATE=$(($(date +%s) - 86400))
# Locale for printf number formatting (e.g. "10000" => "10,000")
LANG=en_US
# Date format for display in emails
HUMAN_FDATE=`date -r "$DATE" "+%Y-%m-%d"`
# --- The actual log parsing ---
LOG_FDATE=`date -r "$DATE" "+${LOG_DATE_FORMAT}"`
DAY_BEFORE_FDATE=`date -r $(($DATE - 86400)) "+${LOG_DATE_FORMAT}"`
# Unique IPs requesting RSS, except those reporting "subscribers":
IPSUBS=`fgrep "$LOG_FDATE" "$LOG_FILE" | fgrep " $RSS_URI" | egrep -v '[0-9]+ subscribers' | cut -d' ' -f 1 | sort | uniq | wc -l`
# Google Reader subscribers and other user-agents reporting "subscribers" and using the "feed-id" parameter for uniqueness:
# `tac` replacement per http://tipstricks.itmatrix.eu/?p=305
GRSUBS=`egrep "($LOG_FDATE|$DAY_BEFORE_FDATE)" "$LOG_FILE" | fgrep " $RSS_URI" | egrep -o '[0-9]+ subscribers; feed-id=[0-9]+' | sort -t= -k2 -s | awk '{print NR,$0}' | sort -nr | sed 's/^[0-9]* //' | uniq -f2 | awk '{s+=$1} END {print s}'`
# Other user-agents reporting "subscribers", for which we'll use the entire user-agent string for uniqueness:
OTHERSUBS=`fgrep "$LOG_FDATE" "$LOG_FILE" | fgrep " $RSS_URI" | fgrep -v 'subscribers; feed-id=' | egrep '[0-9]+ subscribers' | egrep -o '"[^"]+"$' | sort -t\( -k2 -sr | awk '!x[$1]++' | egrep -o '[0-9]+ subscribers' | awk '{s+=$1} END {print s}'`
REPORT=$(
printf "Feed stats for $HUMAN_FDATE:\n\n"
printf "%8d Google Reader subscribers\n" $GRSUBS
printf "%8d subscribers from other aggregators\n" $OTHERSUBS
printf "%8d direct subscribers\n" $IPSUBS
echo "--------"
printf "%8d total subscribers\n" $(($GRSUBS + $OTHERSUBS + $IPSUBS))
)
echo "$REPORT"
echo ""
echo "Also emailed to $MAIL_TO."
echo "$REPORT " | mail -s "[$HUMAN_FDATE] $MAIL_SUBJECT" $MAIL_TO
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment