This comment has been minimized.
This comment has been minimized.Show comment Hide comment
I need to write Scala program combined data from the two different files.
Each line also has another id called 'page view id'. This id is specified by json key 'pv'. You can find json key values similar to "pv":"7963ee21-0d09-4924-b315-ced4adad425f" in both the files.
The aim is to join the data in two files using "pv". You need to parse these files and combine the data in two files to check how many asset impressions, views and clicks are present in both the files for each page view id. the input files are .gz.
If it possible can you help me regarding this task?