Querying Datasets with Cognates in the Lexibank Repository
In order to run the code below (code.py), you need to git-clone the lexibank_analysed
package (https://github.com/lexibank/lexibank-analysed) and install it using pip
. Then, you need to download the script and place it into the lexibank-analysed
folder. Before running, you need to run the command cldfbench download cldfbench_lexibank_analysed.py
in order to download all individual datasets. After that, you can just run the script typing python code.py
. The output looks as follows:
Dataset | Concepts | Languages | Words | Cognates | Singletons |
---|---|---|---|---|---|
bdpa | 519 | 538 | 50095 | 750 | 0 |
blustaustronesian | 210 | 20 | 4358 | 321 | 2409 |
bowernpny | 344 | 190 | 44876 | 4494 | 25054 |
cals |