Skip to content

Instantly share code, notes, and snippets.

@milindjagre
Created May 9, 2017 14:46
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save milindjagre/4faf7073f9fe89b9137a78012fe44be0 to your computer and use it in GitHub Desktop.
Save milindjagre/4faf7073f9fe89b9137a78012fe44be0 to your computer and use it in GitHub Desktop.
this input CSV file is used to demonstrate how to remove duplicate tuples from pig relation
SFO 2008 1 1 90 100 65
LAX 2008 1 2 89 111 67
SFO 2008 1 1 90 100 65
LAX 2008 1 2 89 111 67
DEN 2008 1 3 88 123 67
LAX 2009 10 1 12 132 34
DEN 2007 12 12 90 111 11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment