Skip to content

Instantly share code, notes, and snippets.

@csetzkorn
Created September 23, 2017 19:27
Show Gist options
  • Save csetzkorn/628621c87fd97fb018f14882a8497dbf to your computer and use it in GitHub Desktop.
Save csetzkorn/628621c87fd97fb018f14882a8497dbf to your computer and use it in GitHub Desktop.
Hierarchical Clustering Iris
library(dplyr)
library(ggplot2)
setwd('D:\\ToyData')
OrginalData <- read.table("IrisData.txt",
header = TRUE, sep = "\t")
SubsetData <- subset(OrginalData, select = c(
#"SepalLength"
#,"SepalWidth"
,"PetalLength"
,"PetalWidth"
))
clusters = hclust(dist(SubsetData), method = 'average')
plot(clusters)
clusterCut <- cutree(clusters, 3)
table(clusterCut, OrginalData$Species)
ggplot(OrginalData, aes(PetalLength, PetalWidth, color = OrginalData$Species)) +
geom_point(alpha = 0.4, size = 3.5) + geom_point(col = clusterCut) +
scale_color_manual(values = c('black', 'red', 'green'))
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment