Skip to content

Instantly share code, notes, and snippets.

View srinivasanHadoop's full-sized avatar

srinivasan srinivasanHadoop

View GitHub Profile
@srinivasanHadoop
srinivasanHadoop / TikaFileInputFormat.java
Created October 31, 2013 06:28
i integrated the apache tika with Hadoop mapreduce code
package com.srini.tikacustom;
import java.io.IOException;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.InputSplit;
import org.apache.hadoop.mapreduce.RecordReader;
import org.apache.hadoop.mapreduce.TaskAttemptContext;
import org.apache.hadoop.mapreduce.lib.input.FileInputFormat;