Skip to content

Instantly share code, notes, and snippets.

View timathom's full-sized avatar

Tim Thompson timathom

  • Yale University Library
  • New Haven, CT
View GitHub Profile
@timathom
timathom / pymarc_disco.py
Created September 21, 2017 15:28 — forked from anarchivist/pymarc_disco.py
sample MapReduce tasks for Disco to get tag counts from MARC files/streams
#!/usr/bin/env python
#
# pymarc_disco.py - Mark Matienzo
# sample MapReduce tasks for Disco to get tag counts from MARC files/streams
# usage: python pymarc_disco.py <input1> [input2 ... inputN]
import sys
from disco.core import Disco, result_iterator
from disco.settings import DiscoSettings
import pymarc