Skip to content

Instantly share code, notes, and snippets.

@nat1881
nat1881 / gsoc_cltk_core_2017_summary.md
Last active August 29, 2017 13:50
GSoC 2017 Summary

Natasha Voake

Classical Language Toolkit (CLTK) - Google Summer of Code (GSoC) 2017

  1. The goal of my GSoC project was to extend CLTK coverage to Old and Middle French. To this end I compiled/developed the following:
  • a corpus of Old French (OF) (with particular focus on Anglo-Norman) and Middle French (MF) texts available at https://github.com/cltk/french_text.

  • tokenizers for OF and MF - word and line.