Skip to content

Instantly share code, notes, and snippets.

@csgallego
Created March 18, 2014 03:58
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save csgallego/9613316 to your computer and use it in GitHub Desktop.
Save csgallego/9613316 to your computer and use it in GitHub Desktop.
#Cleaning the dates of the White House visitor logs
z = strptime(visits$APPT_START_DATE,format="%m/%d/%Y %H:%M")
class(z)
head(visits$APPT_START_DATE)
visits$APPT_START_DATE[z$year < 0]
gsub("/13 ","/2013 ",visits$APPT_START_DATE)
visits$CAPPT_START_DATE = gsub("/13 ","/2013 ",visits$APPT_START_DATE)
z = strptime(visits$CAPPT_START_DATE,format="%m/%d/%Y %H:%M")
is.na(z)
sum(is.na(z))
visits$CAPPT_START_DATE[is.na(z)]
visits$CAPPT_START_DATE = gsub("\\/13$","\\/2013",visits$CAPPT_START_DATE)
visits$CAPPT_START_DATE = gsub("\\/13$","\\/2013",visits$CAPPT_START_DATE)
z[is.na(z)] = strptime(visits$CAPPT_START_DATE[is.na(z)],format="%m/%d/%Y")
visits$time = z
range(z)
tick = strptime("12/31/2012 00:00",format="%m/%d/%Y %H:%M")
visits[visits$time < tick,1:5]
visits = visits[visits$time > tick,]
barplot(table(visits$time$mon+1))
hist(visits$time$yday,breaks=1000)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment