In this tutorial you will gain a working knowledge of Pig through the hands-on experience creating Pig scripts to carry out essential data operations and tasks.
We will first read in two data files that contain New York Stock Exchange dividend prices and stock prices, and then use these files to perform a number of Pig operations including:
- Define a relation with and without
schema
- Define a new relation from an
existing relation
Select
specific columns from within a relationJoin
two relations- Sort the data using
‘ORDER BY’