Skip to content

Instantly share code, notes, and snippets.

@EugW
Last active September 29, 2019 09:22
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save EugW/92746e7cce758b51c1b94c290891026a to your computer and use it in GitHub Desktop.
Save EugW/92746e7cce758b51c1b94c290891026a to your computer and use it in GitHub Desktop.
Language to TrainedData for Tesseract OCR
val langData = mapOf(
Pair("English", "eng.traineddata"),
Pair("Spanish", "spa.traineddata"),
Pair("Italian", "ita.traineddata"),
Pair("Russian", "rus.traineddata"),
Pair("Polish", "pol.traineddata"),
Pair("French", "fra.traineddata"),
Pair("German", "deu.traineddata"),
Pair("Portuguese", "por.traineddata"),
Pair("Japanese", "jpn.traineddata"),
Pair("Korean", "kor.traineddata"),
Pair("Chinese - Traditional", "chi_tra.traineddata"),
Pair("Chinese - Simplified", "chi_sim.traineddata")
)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment