Skip to content

Instantly share code, notes, and snippets.

@nchelaru
Created January 14, 2020 18:44
Show Gist options
  • Save nchelaru/281fe77ad75bc305e00bbf5a26d6ed7c to your computer and use it in GitHub Desktop.
Save nchelaru/281fe77ad75bc305e00bbf5a26d6ed7c to your computer and use it in GitHub Desktop.

Multicollinearity in regression must be addressed — variables should be removed until the multicollinearity is gone.

Multicollinearity is not such a problem for trees, clustering and nearest-neighbours methods. In these methods, it may be advisable to retain p dummy variables. However, even in these methods, non-redundancy in predictor variables is still desired.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment