Last active
December 18, 2024 14:06
-
-
Save wanchichen/de008ed6fafca89881a9d0d79b23e58f to your computer and use it in GitHub Desktop.
ML-SUPERB 1.0 Languages
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| [abk] Abkhazian | |
| [afr] Afrikaans | |
| [amh] Amharic | |
| [ara] Arabic | |
| [asm] Assamese | |
| [ast] Asturian | |
| [aze] Azerbaijani | |
| [azz] Highland Puebla Nahuatl | |
| [bak] Bashkir | |
| [bas] Basa (Cameroon) | |
| [bel] Belarusian | |
| [ben] Bengali | |
| [bos] Bosnian | |
| [bre] Breton | |
| [bul] Bulgarian | |
| [cat] Catalan | |
| [ceb] Cebuano | |
| [ces] Czech | |
| [chv] Chuvash | |
| [ckb] Central Kurdish | |
| [cmn] Mandarin Chinese | |
| [cnh] Hakha Chin | |
| [cym] Welsh | |
| [dan] Danish | |
| [deu] German | |
| [div] Dhivehi | |
| [ell] Modern Greek (1453-) | |
| [eng] English | |
| [epo] Esperanto | |
| [est] Estonian | |
| [eus] Basque | |
| [fas] Persian | |
| [fil] Filipino | |
| [fin] Finnish | |
| [fra] French | |
| [frr] Northern Frisian | |
| [ful] Fulah | |
| [gle] Irish | |
| [glg] Galician | |
| [grn] Guarani | |
| [guj] Gujarati | |
| [hau] Hausa | |
| [heb] Hebrew | |
| [hin] Hindi | |
| [hrv] Croatian | |
| [hsb] Upper Sorbian | |
| [hun] Hungarian | |
| [hye] Armenian | |
| [ibo] Igbo | |
| [ina] Interlingua (International Auxiliary Language Association) | |
| [ind] Indonesian | |
| [isl] Icelandic | |
| [ita] Italian | |
| [jav] Javanese | |
| [jpn] Japanese | |
| [kab] Kabyle | |
| [kam] Kamba (Kenya) | |
| [kan] Kannada | |
| [kat] Georgian | |
| [kaz] Kazakh | |
| [kea] Kabuverdianu | |
| [khm] Khmer | |
| [kin] Kinyarwanda | |
| [kir] Kirghiz | |
| [kmr] Northern Kurdish | |
| [kor] Korean | |
| [lao] Lao | |
| [lav] Latvian | |
| [lga] Lungga | |
| [lin] Lingala | |
| [lit] Lithuanian | |
| [ltz] Luxembourgish | |
| [lug] Ganda | |
| [luo] Luo (Kenya and Tanzania) | |
| [mal] Malayalam | |
| [mar] Marathi | |
| [mhr] Eastern Mari | |
| [mkd] Macedonian | |
| [mlt] Maltese | |
| [mon] Mongolian | |
| [mri] Maori | |
| [mrj] Western Mari | |
| [msa] Malay (macrolanguage) | |
| [mya] Burmese | |
| [myv] Erzya | |
| [nan] Min Nan Chinese | |
| [nbl] South Ndebele | |
| [nep] Nepali (macrolanguage) | |
| [nld] Dutch | |
| [nso] Pedi | |
| [nya] Nyanja | |
| [oci] Occitan (post 1500) | |
| [ori] Oriya / Odia | |
| [orm] Oromo | |
| [pan] Panjabi | |
| [pol] Polish | |
| [por] Portuguese | |
| [pus] Pushto | |
| [ron] Romanian | |
| [rus] Russian | |
| [sah] Yakut | |
| [sin] Sinhala | |
| [skr] Saraiki | |
| [slk] Slovak | |
| [slv] Slovenian | |
| [sna] Shona | |
| [snd] Sindhi | |
| [som] Somali | |
| [sot] Southern Sotho | |
| [spa] Spanish | |
| [srp] Serbian | |
| [ssw] Swati | |
| [sun] Sundanese | |
| [swa] Swahili (macrolanguage) | |
| [swe] Swedish | |
| [tam] Tamil | |
| [tat] Tatar | |
| [tel] Telugu | |
| [tgk] Tajik | |
| [tha] Thai | |
| [tok] Toki Pona | |
| [tos] Highland Totonac | |
| [tsn] Tswana | |
| [tso] Tsonga | |
| [tur] Turkish | |
| [uig] Uighur | |
| [ukr] Ukrainian | |
| [umb] Umbundu | |
| [urd] Urdu | |
| [uzb] Uzbek | |
| [ven] Venda | |
| [vie] Vietnamese | |
| [wol] Wolof | |
| [xho] Xhosa | |
| [xty] Yoloxochitl Mixtec | |
| [yor] Yoruba | |
| [yue] Yue Chinese | |
| [zul] Zulu |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment