Skip to content

Instantly share code, notes, and snippets.

@mjpost
Created July 31, 2020 18:47
Show Gist options
  • Save mjpost/4c54446b7030d7c64b57461d27090650 to your computer and use it in GitHub Desktop.
Save mjpost/4c54446b7030d7c64b57461d27090650 to your computer and use it in GitHub Desktop.
#!/usr/bin/env python3
import sys
from sacremoses.normalize import MosesPunctNormalizer
def main(args):
normalizer = MosesPunctNormalizer(lang=args.lang, penn=args.penn)
for line in sys.stdin:
print(normalizer.normalize(line.rstrip()), flush=True)
if __name__ == '__main__':
import argparse
parser = argparse.ArgumentParser()
parser.add_argument('--lang', '-l', default='en')
parser.add_argument('--penn', '-p', action='store_true')
args = parser.parse_args()
main(args)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment