Skip to content

Instantly share code, notes, and snippets.

@ismaelc
Last active November 25, 2021 20:28
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save ismaelc/c9a9d10ed45f713336c741a52757b862 to your computer and use it in GitHub Desktop.
Save ismaelc/c9a9d10ed45f713336c741a52757b862 to your computer and use it in GitHub Desktop.
from sagemaker.sklearn.processing import SKLearnProcessor
from sagemaker.processing import ProcessingInput, ProcessingOutput
sklearn_processor = SKLearnProcessor(framework_version='0.20.0',
role=role,
instance_type='ml.m5.xlarge',
instance_count=1)
sklearn_processor.run(code='preprocessing.py',
inputs=[ProcessingInput(
source='s3://path/to/my/input-data.csv',
destination='/opt/ml/processing/input')],
outputs=[ProcessingOutput(source='/opt/ml/processing/output/train'),
ProcessingOutput(source='/opt/ml/processing/output/validation'),
ProcessingOutput(source='/opt/ml/processing/output/test')]
)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment