Skip to content

Instantly share code, notes, and snippets.

View madgpap's full-sized avatar

George Papadatos madgpap

  • EMBL-EBI
  • Cambridge
View GitHub Profile
@madgpap
madgpap / desc.txt
Last active August 29, 2015 14:19 — forked from mnowotka/desc.txt
We describe how Python is leveraged to streamline the modelling of drug discovery data and the development of tools for the scientific community. We look at various examples, e.g. chemistry toolkits, machine-learning applications and web frameworks and show how Python can glue it all together to create efficient data science pipelines.

ChEMBL is the largest open access database resource in the fields of computational drug discovery, chemoinformatics and chemical biology. Contrary to the common Perl-related perception, the Python programming language is used predominantly in the aforementioned fields. In this presentation, we describe how Python is used as the cornerstone and foundation inside and outside the ChEMBL group, in order to support and streamline many facets of our work, tools and resources. In particular, we cover the following topics: