Skip to content

Instantly share code, notes, and snippets.

@wavescholar
Last active August 29, 2015 14:15
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save wavescholar/bd185d798e94a025446a to your computer and use it in GitHub Desktop.
Save wavescholar/bd185d798e94a025446a to your computer and use it in GitHub Desktop.
HDFS Compression Code Snippet
//Other options for typeare record and block. I'n not sure that block will work with any other codec than bzip2
CompressionType compressionType= CompressionType.NONE;
compressionCodecEnum {gzip, bzip2,none};
compressionCodecEnum compressionCodecType = compressionCodecEnum.bzip2;
if( compressionCodecType==compressionCodecEnum.bzip2)
{
org.apache.hadoop.io.SequenceFile.Writer.Option compressionClass = SequenceFile.Writer.valueClass(GzipCodec.class);
CompressionCodec Codec = new BZip2Codec();
org.apache.hadoop.io.SequenceFile.Writer.Option optCom = SequenceFile.Writer.compression(CompressionType.BLOCK, Codec);
sequenceFileWriter = SequenceFile.createWriter(conf, filePath, keyClass, valueClass,optCom);
}
if( compressionCodecType==compressionCodecEnum.gzip)
{
org.apache.hadoop.io.SequenceFile.Writer.Option compressionClass = SequenceFile.Writer.valueClass(GzipCodec.class);
CompressionCodec Codec = new GzipCodec();
org.apache.hadoop.io.SequenceFile.Writer.Option optCom = SequenceFile.Writer.compression(CompressionType.RECORD, Codec);
sequenceFileWriter = SequenceFile.createWriter(conf, filePath, keyClass, valueClass,optCom);
}
if( compressionCodecType==compressionCodecEnum.none)
{
sequenceFileWriter = SequenceFile.createWriter(conf, filePath, keyClass, valueClass);
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment