Skip to content

Instantly share code, notes, and snippets.

@andreajparker
Created January 14, 2020 16:36
Show Gist options
  • Save andreajparker/d216bcaab9e3ba30e135f8b666368492 to your computer and use it in GitHub Desktop.
Save andreajparker/d216bcaab9e3ba30e135f8b666368492 to your computer and use it in GitHub Desktop.
Curated list of DS libraries

Phase: Data

Data Annotation

Datasets

Importing Data

Data Augmentation

Phase: Exploration

Data Preparation

Notebook Exploration

  • View Jupyter notebooks through CLI: nbdime
  • Parametrize notebooks: papermill
  • Access notebooks programatically: nbformat
  • Convert notebooks to other formats: nbconvert
  • Extra utilities not present in frameworks: mlxtend
  • Maps in notebooks: ipyleaflet

Phase: Feature Engineering

Feature Generation

Phase: Modeling

Model Selection

NLP

Speech Recognition

RecSys

  • Factorization machines (FM), and field-aware factorization machines (FFM): xlearn
  • Scikit-learn like API: surprise
  • Recommendation System in Pytorch: CaseRecommender

Computer Vision

Timeseries

Framework extensions

Phase: Monitoring

Model Training Monitoring

Phase: Optimization

Hyperparameter Optimization

Interpretability

Visualization

Phase: Production

Model Serialization

Scalability

Bechmark

API

Dashboard

Adversarial testing

Python libraries

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment