Skip to content

Instantly share code, notes, and snippets.

View derekpeterson's full-sized avatar

Derek Peterson derekpeterson

  • Seattle, WA
View GitHub Profile
@derekpeterson
derekpeterson / CityReviews.py
Last active December 14, 2015 04:49
Simple MRjob script to count words from a TSV with data in the form "category\t[item1,item2,item3]".
#!/usr/bin/env python
from mrjob.job import MRJob
import json
import re
class CityReviews(MRJob):
def mapper(self, _, line):
line = re.sub(r'\[|\]| ', '', line)