Skip to content

Instantly share code, notes, and snippets.

@jbarnoud
Last active September 28, 2015 09:48
Show Gist options
  • Save jbarnoud/fc27c5048d6e8f394598 to your computer and use it in GitHub Desktop.
Save jbarnoud/fc27c5048d6e8f394598 to your computer and use it in GitHub Desktop.
Hereby is a notebook that explores K-means in the context of PBxplore.
Display the source blob
Display the rendered blob
Raw
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@HubLot
Copy link

HubLot commented Sep 28, 2015

Great job Jonathan!
RMSD could be a nice measure for the different clusters. The issue on a regular MD (the 270 sequences you tested) is to know the good number of clusters. Maybe '4' is not a good one, hence the reproducibility is hard to assess.
The issue, I think, about built-in k-means it is really difficult to have a custom distance metrix and a custom representation of the centroids.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment