Skip to content

Instantly share code, notes, and snippets.

@snoremac
Created January 5, 2014 22:07
Show Gist options
  • Save snoremac/8274576 to your computer and use it in GitHub Desktop.
Save snoremac/8274576 to your computer and use it in GitHub Desktop.
An example of prototyping Hadoop streaming using command line utilities.
./src/ruby/common_crawl_input.rb 2>/dev/null \
| ./src/ruby/common_crawl_mapper.rb hello,world \
| sort -t$'\t' -k1 \
| ./src/ruby/common_crawl_reducer.rb
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment