Skip to content

Instantly share code, notes, and snippets.


Block or report user

Report or block VedAustin

Hide content and notifications from this user.

Learn more about blocking users

Contact Support about this user’s behavior.

Learn more about reporting abuse

Report abuse
View GitHub Profile
VedAustin /
Last active Mar 24, 2018
How to join disparate data sources and map the customer journey through various touch points
from pyspark.sql import functions as F
from pyspark.sql import Window
# Read data
user_guid_email ="/mnt/public-blobs/attribution-modelling/data2/id-maps/id-map-email.json")
user_guid_paid_search ="/mnt/public-blobs/attribution-modelling/data2/id-maps/id-map-paid-search.json")
user_guid_social ="/mnt/public-blobs/attribution-modelling/data2/id-maps/id-map-social.json")
guid_event_email ="/mnt/public-blobs/attribution-modelling/data2/events-email")
guid_event_paid_search ="/mnt/public-blobs/attribution-modelling/data2/events-paid-search")
You can’t perform that action at this time.