Skip to content

Instantly share code, notes, and snippets.

@dkn22
Last active March 15, 2018 23:07
Show Gist options
  • Save dkn22/d2762e8899192d77510f31d620a3895e to your computer and use it in GitHub Desktop.
Save dkn22/d2762e8899192d77510f31d620a3895e to your computer and use it in GitHub Desktop.
Using embedder to transform categorical variables
from embedder.regression import Embedder
from sklearn.pipeline import Pipeline
from xgboost import XGBRegressor
embedder = Embedder(embedding_dict)
embedder.fit(X_encoded, y)
X_embedded = embedder.transform(X)
# X_embedded = embedder.fit_transform(X, y)
# using sklearn pipeline
pipeline = Pipeline(
[('embedding', Embedder(embedding_dict)),
('xgboost', XGBRegressor())]
)
pipeline.fit(X_encoded, y)
predictions = pipeline.predict(X_test, y_test)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment