- 243 PRs by 61 committers
- 437 messages on the mailing list
- 40 companies
- Upcoming unsolicited talk: "A Practical Introduction to Airflow" at PyBay by MattDavis (Clover health)
No wonder it's hard to keep up!
Author: jlowin 38 Author: Bolke de Bruin 28 Author: Chris Riccomini 21 Author: Dan Davydov 18 Author: Maxime Beauchemin 11 Author: Kengo Seki 11 Author: Siddharth Anand 11 Author: Stanislav Kudriashev 8 Author: Sid Anand 7 Author: Sumit Maheshwari 7 Author: Arthur Wiedmer 6 Author: Li Xuanji 5 Author: Alex Van Boxel 4 Author: Eric Stern 4 Author: Hongbo Zeng 3 Author: Hervé Werner 3 Author: Norman Mu 3 Author: Alexey Ustyantsev 3 Author: Ajay Yadav 2 Author: Rob Froetscher 2 Author: Ilya Rakoshes 2 Author: Tsuyoshi Ozawa 2 Author: John Bodley 2 Author: Joy Gao 2 Author: Junwei Wang 2 Author: Yap Sok Ann 2 Author: Ajay Yadava 2
- Scheduler complete rewrite with amazing cross company collaboration
- Scheduler is threaded and will scale to the next order of magnitude
- Scheduler is insulated, DAG code is parsed in subprocesses
- jlowin's Git integration tool makes working with Apache's constraint so much easier, less friction when merging is really important
- @mistercrunch no longer a bottleneck for the project, many engineers from different companies know the core of the project
- More logging & stats collection in critical areas
- Refactor of the dependency engine (@aoen pending merge), along with a "Why isn't my task triggered" web page
- Workload management for Hive
- SparkSql & EMR operators
- Apache compliance (removing Highcharts, license headers)
- Tons of bug fixes, tweaks and polish