Last active
January 4, 2022 17:48
-
-
Save increpare/ba72c75aa19af4899a9a6975b9231d76 to your computer and use it in GitHub Desktop.
toki pona language statistics
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
some toki pona stats based on pu | |
letter frequency: | |
i 967 | |
a 953 | |
l 706 | |
n 614 | |
o 560 | |
e 545 | |
m 388 | |
s 301 | |
p 272 | |
k 255 | |
t 252 | |
u 219 | |
w 187 | |
j 153 | |
character frequency: | |
i 967 | |
a 953 | |
l 706 | |
n 614 | |
o 560 | |
e 545 | |
m 388 | |
s 301 | |
p 272 | |
. 265 | |
k 255 | |
t 252 | |
u 219 | |
w 187 | |
j 153 | |
? 46 | |
, 26 | |
! 25 | |
: 10 | |
" 8 | |
most common letter pairs: | |
li 446 | |
na 263 | |
on 206 | |
an 146 | |
mi 131 | |
in 127 | |
po 115 | |
si 113 | |
ma 102 | |
wa 101 | |
la 100 | |
il 100 | |
te 96 | |
lo 95 | |
al 93 | |
ok 79 | |
aw 78 | |
el 78 | |
to 77 | |
so 76 | |
ka 74 | |
ja 70 | |
ta 69 | |
ki 66 | |
word frequency: | |
li 228 | |
e 146 | |
mi 102 | |
pona 86 | |
sina 67 | |
jan 61 | |
lon 45 | |
toki 44 | |
sona 39 | |
lili 37 | |
tawa 36 | |
ni 35 | |
ala 33 | |
seme 33 | |
la 32 | |
o 31 | |
jo 27 | |
ona 27 | |
tomo 27 | |
kama 27 | |
moku 25 | |
mute 25 | |
ike 24 | |
telo 22 | |
tenpo 22 | |
meli 21 | |
suli 21 | |
sewi 21 | |
pali 20 | |
soweli 19 | |
ma 19 | |
mije 19 | |
mama 19 | |
wawa 19 | |
ijo 18 | |
pana 17 | |
wile 16 | |
kute 15 | |
ilo 14 | |
wan 14 | |
tan 14 | |
kulupu 13 | |
kili 12 | |
ale 12 | |
pilin 12 | |
a 12 | |
pi 12 | |
anu 11 | |
kepeken 10 | |
sin 10 | |
kala 9 | |
lipu 9 | |
suno 9 | |
awen 8 | |
tu 8 | |
nanpa 8 | |
pu 8 | |
taso 7 | |
nasin 7 | |
nimi 7 | |
kasi 6 | |
mawijo 6 | |
weka 6 | |
pimeja 6 | |
lukin 5 | |
utala 5 | |
ken 4 | |
ante 4 | |
olin 4 | |
sijelo 4 | |
mu 3 | |
sili 3 | |
lawa 3 | |
pini 3 | |
lupa 3 | |
luka 3 | |
sama 3 | |
poka 3 | |
laso 3 | |
tosi 2 | |
en 2 | |
isa 2 | |
noka 2 | |
inli 2 | |
loje 2 | |
insa 2 | |
anpa 2 | |
lape 2 | |
open 2 | |
alasa 2 | |
suwi 1 | |
len 1 | |
waso 1 | |
sonko 1 | |
nasa 1 | |
kon 1 | |
pu character frequency: | |
i 967 | |
a 953 | |
l 706 | |
n 614 | |
o 560 | |
e 545 | |
m 388 | |
s 301 | |
p 272 | |
. 265 | |
k 255 | |
t 252 | |
u 219 | |
w 187 | |
j 153 | |
? 46 | |
, 26 | |
! 25 | |
: 10 | |
" 8 | |
other/different tp stats info - http://jimhenry.conlang.org/conlang/tokipona/tokipona.htm | |
minimal pair info - http://tokipona.net/tp/MinimalPairs.aspx |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment