Skip to content

Instantly share code, notes, and snippets.

@navarasu
Last active October 13, 2019 12:57
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save navarasu/7cc72118d8d88a8fa5ad6e18e969c5b2 to your computer and use it in GitHub Desktop.
Save navarasu/7cc72118d8d88a8fa5ad6e18e969c5b2 to your computer and use it in GitHub Desktop.
Pre process training data
#Prepare event matrix
input_products = sorted(PRODUCTS[PRODUCTS !='Product A'])
event_matrix=generate_event_matrix_index(processed_data,input_products)
x_train_temp=x_train.apply(lambda x : pd.Series(util.encode_events(event_matrix,x[input_products].dropna().items())) ,axis=1)
x_train=x_train_temp.join(x_train[['cus_type','cus_point']])
#Preprocess categorial data
cus_type = CategoricalDtype(categories=processed_data['cus_type'].unique(), ordered=True)
x_train['cus_type']=x_train['cus_type'].astype(cus_type)
x_train=pd.get_dummies(x_train,prefix='cus')
#Preprocess continuous data
cus_point_scaler = MinMaxScaler()
x_train["cus_point"]=cus_point_scaler.fit_transform(x_train[["cus_point"]])
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment