Skip to content

Instantly share code, notes, and snippets.

Forked from tovbinm/Add LZO to Hadoop
Created February 19, 2013 07:31
Show Gist options
  • Save bugcy013/4983797 to your computer and use it in GitHub Desktop.
Save bugcy013/4983797 to your computer and use it in GitHub Desktop.
On Ubuntu:
1.sudo apt-get install lzop liblzo2-dev and build:
3.copy the resulted jar to: <yourhadoop>/lib/, typically: /usr/lib/hadoop/lib/
5.cp ./hadoop-gpl-compression-0.1.0/lib/native/Linux-<your_acrh_type>/*.* /usr/lib/hadoop/lib/native/Linux-<your_acrh_type>/
6.Add the following properties to core-site.xml:
echo "hello world" > test.log
lzop test.log
hadoop fs -copyFromLocal test.log.lzo /tmp
hadoop jar /usr/lib/hadoop/lib/hadoop-lzo.jar com.hadoop.compression.lzo.LzoIndexer /tmp/test.log.lzo
hadoop fs -libjars /app/hadoop/resources/conduit_data_types.jar,/app/hadoop/resources/json-rpc-1.0.jar -text /user/mapred/<compressedfile> > out.out
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment