Skip to content

Instantly share code, notes, and snippets.

View sd2k's full-sized avatar

Ben Sully sd2k

  • Grafana Labs
  • Sheffield
View GitHub Profile

Keybase proof

I hereby claim:

  • I am sd2k on github.
  • I am bsull (https://keybase.io/bsull) on keybase.
  • I have a public key ASDRT3yQq5G5H2ezmV5LZCAw3TCyxQ2DWWieBqswPy4dAwo

To claim this, I am signing this object:

@sd2k
sd2k / sparkr-dplyr
Created June 21, 2015 00:08
Methods to integrate SparkR & dplyr, and in doing so allow non-standard evaluation (e.g. select(df, age) instead of select(df, df$age))
#' dplyr compatibility methods.
#'
#' These allow Spark DataFrames to cooperate with standard dplyr verbs in a
#' more-or-less ordinary way. This also means we can chain commands together
#' using pipes (`%>%`) when selecting, filtering, adding to or summarising a
#' DataFrame.
#'
#' Currently requires dplyr to be loaded after SparkR, since SparkR clobbers
#' many of the dplyr verbs.