Skip to content

Instantly share code, notes, and snippets.

@anjijava16
Created September 22, 2017 18:29
Show Gist options
  • Save anjijava16/354c11308c5a323301c22eff9d222906 to your computer and use it in GitHub Desktop.
Save anjijava16/354c11308c5a323301c22eff9d222906 to your computer and use it in GitHub Desktop.
There are 3 ways of processing xml files in Hadoop:-
1. PIG:- Using classes from Piggybank jar file.
2. HIVE:- Using SerDe (Serialization Deserialization) Method.
3. MapRedude Coding:- Lengthy coding using classes from OOXML jar files
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment