Skip to content

Instantly share code, notes, and snippets.

@davidsnyder
Created December 21, 2011 20:54
Show Gist options
  • Save davidsnyder/1507662 to your computer and use it in GitHub Desktop.
Save davidsnyder/1507662 to your computer and use it in GitHub Desktop.
Swineherd test

‘foo.pig.erb’:

fips = LOAD '<%= in_path %>' AS (fips_id:int,state_name:chararray);
DUMP fips;

Ruby irb interpreter:

require 'swineherd'
script = Swineherd::Script.new('foo.pig.erb',{"in_path" => "fips_to_state.tsv"}) 
script.run(:run_mode => :local,:combine_splits => false,:map_tasks => 10,:param => {:foo => "bar"}) 
--> ENV['PIG_OPTS'] = '-Dmapred.map.tasks=10 -Dpig.splitCombination=false'
--> /usr/local/share/pig/bin/pig -x local -p FOO=bar /tmp/5575-1324500132-foo.pig

cli:

swineherd --combine_splits=false \
--map_tasks=10 --pig.param.foo=bar \
--script.var.in_path='fips_to_state.tsv' foo.pig.erb

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment