Skip to content

Instantly share code, notes, and snippets.

@IvanNardini
Created June 7, 2020 15:51
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save IvanNardini/2e9b69ac33a9d474e9e6a0d41d87d874 to your computer and use it in GitHub Desktop.
Save IvanNardini/2e9b69ac33a9d474e9e6a0d41d87d874 to your computer and use it in GitHub Desktop.
MLOps series #1 : Batch scoring with Mlflow Model (Mleap flavor) on Google Cloud Platform
#! bin/bash
#Pass CLUSTER_NAME, REGION AND BUCKET parameters (or use default parameters)
CLUSTER_NAME=${1:-cluster-00000}
REGION=${2:-europe-west6}
BUCKET=${3:-cloud-demo-databrick-gcp}
#Run job
gcloud dataproc jobs submit pyspark \
--cluster ${CLUSTER_NAME} \
--region ${REGION} \
gs://${BUCKET}/2_model/score.py --input "gs://cloud-demo-databrick-gcp/1_data/boston_house_prices_toscore.csv" --output "gs://cloud-demo-databrick-gcp/1_data/boston_house_prices_scored.csv"
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment