Skip to content

Instantly share code, notes, and snippets.

@arpitHub
Last active July 26, 2018 14:41
Show Gist options
  • Save arpitHub/717605192661e1229d49eea88b98ad56 to your computer and use it in GitHub Desktop.
Save arpitHub/717605192661e1229d49eea88b98ad56 to your computer and use it in GitHub Desktop.
Useful sources for Machine Learning

Datasets for projects

https://www.dataquest.io/blog/free-datasets-for-projects/

Comparision of Supervised Learning Algorithms

http://www.dataschool.io/comparing-supervised-learning-algorithms/

Things to know about Machine Learning

https://homes.cs.washington.edu/~pedrod/papers/cacm12.pdf

Which is worse - False Positive or False Negative

https://www.quora.com/Whats-worse-a-false-positive-or-false-negative-in-machine-learning

Dealing with Categorical Variables for Predictive Modeling

https://www.analyticsvidhya.com/blog/2015/11/easy-methods-deal-categorical-variables-predictive-modeling/

Handle Muticlass features for Predictive Modeling

https://www.quora.com/How-do-I-use-machine-learning-with-classification-data-that-has-multiple-values-per-variable#

Parametric Vs. Non Parametric models

https://www.quora.com/What-is-the-difference-between-a-parametric-model-and-a-non-parametric-model

Oversampling and Undersampling (to fix bias in dataset)

https://en.wikipedia.org/wiki/Oversampling_and_undersampling_in_data_analysis

Choosing right ML algorithm

https://www.quora.com/How-do-we-decide-which-algorithm-to-use-in-machine-learning

Neural Network (Deep Learning)

https://www.youtube.com/watch?v=aircAruvnKk&list=PLZHQObOWTQDNU6R1_67000Dx_ZCJB-3pi

Pandas comparison with the SQL:

https://pandas.pydata.org/pandas-docs/stable/comparison_with_sql.html

Bimodal Histogram

https://math.stackexchange.com/questions/1681699/is-this-histogram-considered-bimodal

bimodal

CNN explained

https://www.youtube.com/watch?v=FmpDIaiMIeA

Hackathon Tool box

https://www.thecodeship.com/general/hackathon-toolbox-essential-tools-practices/

Cheatsheets

  • Pandas Cheat-sheet

https://github.com/pandas-dev/pandas/blob/master/doc/cheatsheet/Pandas_Cheat_Sheet.pdf

  • Numpy Cheat-sheet

https://s3.amazonaws.com/assets.datacamp.com/blog_assets/Numpy_Python_Cheat_Sheet.pdf

  • Scikitlearn Cheat-sheet

https://s3.amazonaws.com/assets.datacamp.com/blog_assets/Scikit_Learn_Cheat_Sheet_Python.pdf

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment