Skip to content

Instantly share code, notes, and snippets.

@tobowers
Created May 11, 2015 14:15
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save tobowers/394370c13c0beafe2dd5 to your computer and use it in GitHub Desktop.
Save tobowers/394370c13c0beafe2dd5 to your computer and use it in GitHub Desktop.
Talk #1: ETL Architecture For Experimentation at Flatiron Health
Gil will share some of the principles and techniques used by Flatiron to build the nation’s largest cancer database. Flatiron Health’s platform connects cancer centers across the country on a common technology infrastructure, supporting treatment for about 1 in 5 active cancer patients in the U.S. Their architecture is focused on enabling rapid development iterations with cancer specialists, nurses, and other stakeholders. Rapidly integrating medical data from hundreds of heterogeneous source systems required Flatiron to develop a unique data integration and processing framework. Iterating quickly while incorporating knowledge from oncologists, nurses, medical informaticists, and healthcare analysts has largely shaped this architecture as well.
The talk will focus on tooling and the architecture that we built to support rapid iteration on complex ETL processes (iteration itself is quick) and minimizes knowledge translation iterations (number of iterations is smaller - we get stuff right with less iterations). The lessons taught here are on building architecture that optimally supports a business process or a unique development process and less about CS concepts like scalability, reliability etc.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment