Skip to content

Instantly share code, notes, and snippets.

@yudetamago
Created December 12, 2017 06:43
Show Gist options
  • Save yudetamago/7d653518ca20b3f96bc7ef56059e34d7 to your computer and use it in GitHub Desktop.
Save yudetamago/7d653518ca20b3f96bc7ef56059e34d7 to your computer and use it in GitHub Desktop.
DataProcSparkOperator example
# airflow 1.9 required
from datetime import datetime
from airflow import DAG
from airflow.contrib.operators.dataproc_operator import DataProcSparkOperator
TASK_ID = 'dataproc_spark_submit'
MAIN_JAR = 'gs://your-bucket/path/test-job.jar'
JOB_NAME = 'hello'
CLUSTER_NAME = 'test-cluster'
with DAG('dataproc_spark_submit', schedule_interval='@once', start_date=datetime(2017, 12, 11)) as dag:
dataproc_spark_submit = DataProcSparkOperator(
task_id=TASK_ID,
main_jar=MAIN_JAR,
job_name=JOB_NAME,
cluster_name=CLUSTER_NAME
)
dataproc_spark_submit
@shekarraj3
Copy link

how it consider the mail class?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment