Skip to content

Instantly share code, notes, and snippets.

@andychase
Last active April 5, 2017 21:26
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save andychase/8d64812b75fa1489eccc34edd9cf6a73 to your computer and use it in GitHub Desktop.
Save andychase/8d64812b75fa1489eccc34edd9cf6a73 to your computer and use it in GitHub Desktop.
cs_476_data_script.r
library(readr)
data <- read_csv("data.csv",
col_types = cols(date = col_datetime(format = "%FT%T")))
# costs <- read_csv("Cloud Provider Costs - Sheet1.csv")
normz <- function(column) { ( column - mean(column) ) / sd(column) }
# data <- merge(data, costs)
data[,c("nginx")] <- normz(getElement(data,"nginx"))
data[,c("sha256sum")] <- normz(getElement(data,"sha256sum"))
data[,c("sysbench/memory")] <- normz(getElement(data,"sysbench/memory"))
data[,c("sysbench/cpu")] <- normz(getElement(data,"sysbench/cpu"))
data[,c("sorting")] <- normz(getElement(data,"sorting"))
data[,c("ffmpeg")] <- normz(getElement(data,"ffmpeg"))
data$hour = as.double(format(data$date, "%H"))
data$weekday = format(data$date, "%A")
data$total = rowMeans(data[,3:8])
aggregate(data, by=list(name=data$name), FUN=mean)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment