Skip to content

Instantly share code, notes, and snippets.

@alexanderlz
Created July 9, 2012 14:51
Show Gist options
  • Save alexanderlz/3076995 to your computer and use it in GitHub Desktop.
Save alexanderlz/3076995 to your computer and use it in GitHub Desktop.
oneliner to find which logs weren't updated in HDFS today
diff <(hadoop fs -ls /user/mapred/ | cut -f4 -d'/' | sort -u) <(hadoop fs -ls /user/mapred/*/*_data/$(date +%Y%m%d)* | cut -f4 -d'/' | sort -u) | grep '<' | cut -f2 -d'<'
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment