Skip to content

Instantly share code, notes, and snippets.

@graydon
Forked from Manishearth/devanagari.txt
Created April 6, 2019 05:28
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save graydon/3c8925dda6215a291136fe30342418a8 to your computer and use it in GitHub Desktop.
Save graydon/3c8925dda6215a291136fe30342418a8 to your computer and use it in GitHub Desktop.
devanagari breakdown
common stuff
-----------
Basic consonants(32):
कखगघङचछजझटठडढणतथदधनपफबभमयरलवशषसह
Weirdo that only is used in ligatures, but necessary(1)
basic standalone vowels (11):
अआइईउऊऋएऐओऔ
basic vowels (12)
ि
virama(1)
nukta(1) This is usually not present in NFC form, you can get rid of it if you want
weirdo vowel that means different things based on language(1)
[59]
Language-specific
---------------
Hindi:
nukta'd consonants for hindi/urdu-only (7)
can be represented as consonant + nukta, but NFC to these, so the consonant + nukta form is hard to see
क़ख़ग़ज़ड़ढ़फ़
I think you can get rid of the nukta and ण, ञ, or ढ़ and *most* words will be fine
Marathi (1):
Kashmiri (11):
ऎऒऄॳॴॶॷ
sanskrit (7):
ऌॠॡ
Sindhi (4):
ॻॼॾॿ
marwari(1):
Transcribing other languages
------
marathi only, for transcribing other languages (4):
ॲऑऍ
Consonants only used for transcribing other languages (5):
ऩऱऴय़ॹ
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment