Skip to content

Instantly share code, notes, and snippets.

@krsnewwave
Created March 11, 2022 15:04
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save krsnewwave/8fa5488faa9d54c2afd58d80ae418ee8 to your computer and use it in GitHub Desktop.
Save krsnewwave/8fa5488faa9d54c2afd58d80ae418ee8 to your computer and use it in GitHub Desktop.
kedro catalog yaml with mlflow
# in <root>/conf/base/catalog.yaml
insurance:
type: pandas.CSVDataSet
filepath: data/01_raw/train.csv
layer: raw
model_input_table:
type: pandas.ParquetDataSet
filepath: data/03_primary/model_input_table.pq
layer: primary
xgboost_pipe.clf:
type: kedro_mlflow.io.artifacts.MlflowArtifactDataSet
data_set:
type: kedro_mlflow.io.models.MlflowModelSaverDataSet
flavor: mlflow.sklearn
filepath: data/06_models/xgboost_clf.pickle
# metrics for mlflow
xgboost_pipe.model_metrics:
type: kedro_mlflow.io.metrics.MlflowMetricsDataSet
prefix: metrics
# objects to upload in mlflow
xgboost_pipe.roc_graph:
type: kedro_mlflow.io.artifacts.MlflowArtifactDataSet
data_set:
type: kedro.extras.datasets.matplotlib.MatplotlibWriter
filepath: data/06_models/xgb.roc_plot.png
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment