Skip to content

Instantly share code, notes, and snippets.

@muhammetfaik
Created July 26, 2017 03:19
Show Gist options
  • Save muhammetfaik/eb26156eaabfef0c7e7d5619f90a4969 to your computer and use it in GitHub Desktop.
Save muhammetfaik/eb26156eaabfef0c7e7d5619f90a4969 to your computer and use it in GitHub Desktop.
fsdf
text_file = spark.textFile("hdfs://...")
text_file.flatMap(lambda line: line.split())
.map(lambda word: (word, 1))
.reduceByKey(lambda a, b: a+b)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment