Skip to content

Instantly share code, notes, and snippets.

@svenkreiss
Last active August 29, 2015 14:22
Show Gist options
  • Save svenkreiss/46bdce005bb15081a751 to your computer and use it in GitHub Desktop.
Save svenkreiss/46bdce005bb15081a751 to your computer and use it in GitHub Desktop.
pysparkling example: word count
from pysparkling import Context
counts = Context().textFile(
'README.rst'
).flatMap(
lambda line: line.split(' ')
).map(
lambda word: (word, 1)
).reduceByKey(
lambda a, b: a + b
)
print(counts.collect())
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment