Skip to content

Instantly share code, notes, and snippets.

@LorePep
LorePep / parallel_dag.py
Created September 20, 2018 16:48
A simple parallel dag with Airflow
import os
from datetime import datetime, timedelta
from airflow import DAG
from airflow.operators.bash_operator import BashOperator
from airflow.operators.python_operator import PythonOperator
DATA_FOLDER = "data"
@LorePep
LorePep / multiple_workers.py
Last active November 26, 2018 07:47
A simple multiworkers pipeline in Luigi
import luigi
NUMBER_OF_FILES = 50
class WritePipelineTask(luigi.Task):
flow_id = luigi.Parameter()
def output(self):
return luigi.LocalTarget("data/output_one_{}.txt".format(self.flow_id))
@LorePep
LorePep / multi_file_task.py
Last active July 17, 2020 07:48
A luigi pipeline with multiple files
import luigi
NUMBER_OF_FILES = 50
class WritePipelineTask(luigi.Task):
def output(self):
return luigi.LocalTarget("data/_DONE_WRITE_PIPELINE_TASK")
@LorePep
LorePep / simpe_dag.py
Created September 20, 2018 16:40
A simple workflow in Airflow
import os
from datetime import datetime, timedelta
from airflow import DAG
from airflow.operators.bash_operator import BashOperator
from airflow.operators.python_operator import PythonOperator
OUTPUT_ONE_PATH = "data/output_one.txt")
OUTPUT_TWO_PATH = "data/output_two.txt")
@LorePep
LorePep / luigi_simple.py
Last active November 26, 2018 07:46
Simple pipeline in luigi
import luigi
class WritePipelineTask(luigi.Task):
def output(self):
return luigi.LocalTarget("data/output_one.txt")
def run(self):
with self.output().open("w") as output_file: