Skip to content

Instantly share code, notes, and snippets.

@jspacker
Last active December 15, 2015 16:09
Show Gist options
  • Save jspacker/5286649 to your computer and use it in GitHub Desktop.
Save jspacker/5286649 to your computer and use it in GitHub Desktop.
twitter-pagerank controlscript: a preprocessing step
print "Starting preprocessing step."
preprocess = Pig.compileFromFile(self.preprocessing_script)
preprocess_params = {
"INPUT_PATH": self.edges_input,
"PAGERANKS_OUTPUT_PATH": self.preprocess_pageranks,
"NUM_NODES_OUTPUT_PATH": self.preprocess_num_nodes
}
preprocess_bound = preprocess.bind(preprocess_params)
preprocess_stats = preprocess_bound.runSingle()
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment