Personal list of cool Python related projects to play with. Projects are counted as long as they have a Python API regardless if the underlying code is written in another language.
- PySpark
- GraphLab Create
- Scikit Learn
- Tensorflow
- Cloud Dataflow
- Pandas
- Caffe
- MXNet
- gensim (can use word2vec without the need to do stemming, remove stop words etc.)
- Scikit-image for feature extractions
- SimpleCV
- XGBoost - for Gradient Boosting trees
- Statsmodel
- Pymc - MCMC with hierarchical models and uses graphical models
- emcee -
- GPy - Gaussian Process
- Spearmint - also check out the NIPS paper reference at the repo
- IPython notebook
- Matplotlib
- Seaborn
- Plotly - can be interactive
- Bokeh - interactive with D3.js backend
- H5py
- Feather - for exchanging dataframes between R and Python
- Parquet (via PySpark / Py4j currently)
- Dask
- Joblib
- PySpark
- MPI4Py
- Numba
- Pypy
- Theano