Skip to content

Instantly share code, notes, and snippets.

@aneesha
Created June 6, 2018 04:49
Show Gist options
  • Save aneesha/7b53a89025bfc05a1c8ced0b73d557f8 to your computer and use it in GitHub Desktop.
Save aneesha/7b53a89025bfc05a1c8ced0b73d557f8 to your computer and use it in GitHub Desktop.
import dask.bag as db
import json
records = db.read_text('data/2018-*-*.json').map(json.loads)
records.filter(lambda d: d['username'] == 'Aneesha').pluck('id').frequencies()
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment