Skip to content

Instantly share code, notes, and snippets.

@mallipeddi
Created August 18, 2009 14:15
Show Gist options
  • Save mallipeddi/169730 to your computer and use it in GitHub Desktop.
Save mallipeddi/169730 to your computer and use it in GitHub Desktop.
# checkout the LZO-related code from Google Code
svn checkout http://hadoop-gpl-compression.googlecode.com/svn/trunk/ hadoop-gpl-compression
# build the Java code
cd hadoop-gpl-compression
ant
# move the jar file to $HADOOP_HOME/lib
mv build/hadoop-gpl-compression-0.1.0-dev.jar $HADOOP_HOME/lib/
# build the LZO native C code
cd src/native
sudo apt-get install liblzo2-dev
# indicate if your JVM is 32-bit or 64-bit
export JVM_DATA_MODEL=32
./configure
make
cp -rf .libs/libgplcompression.* $HADOOP_HOME/lib/Linux-i386-32/
# now you're good to go. launch a test job to verify if LZO compression is working fine.
$HADOOP_HOME/bin/hadoop jar $HADOOP_HOME/hadoop-0.20.0-examples.jar randomtextwriter -Dtest.randomtextwrite.bytes_per_map=104857600 -Dtest.randomtextwrite.maps_per_host=3 -Dmapred.output.compress=true -Dmapred.output.compression.type=BLOCK -Dmapred.output.compression.codec=com.hadoop.compression.lzo.LzopCodec randomtext_BLOCK_LzopCodec
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment