The last [gist] (gist.github.com/karanth/8736340) was about installing the HortonWorks sandbox and getting to know the entry points into the sandbox. A next step for most people who are starting to learn Hadoop is to either run the example MapReduce (MR) jobs that come with the Hadoop distribution, or to write a simple MR job like word count.
The HUE web page at http://localhost:8888
does not allow for execution of Hadoop MR jobs from java programs, without the use of higher level abstractions like HIVE (SQL-like) or Pig. Hadoop MR jobs can be run by logging into the sandbox (recall Alt + F5
or ssh
) and executing jobs on the sandbox's terminal.
####Running Hadoop Example Programs
The sandbox has the hadoop MR examples in the directory /usr/lib/hadoop-mapreduce
. The file name is of the form, hadoop-mapreduce-examples-*.jar
. * (asterisk) is the wildcard for the version details of the jar file.
To run an example, the pi estimation program in this case, the command is,