Skip to content

Instantly share code, notes, and snippets.

@LinguList
LinguList / README.md
Created March 26, 2019 11:41
A Primer on Automatic Inference of Sound Correspondence Patterns (3): Extended Experiments with Alignments from the Tableaux Phonétiques des Patois Suisses Romands

A Primer on Automatic Inference of Sound Correspondence Patterns (3): Extended Experiments with Alignments from the Tableaux Phonétiques des Patois Suisses Romands

To run the script provided here, make sure to download the GIST, and install the requirements for LingRex. Then, simply type:

$ python code.py
@LinguList
LinguList / README.md
Created February 27, 2019 15:19
A Primer on Automatic Inference of Sound Correspondence Patterns (2): Initial Experiments with Alignments from the Tableaux Phonétiques des Patois Suisses Romands

A Primer on Automatic Inference of Sound Correspondence Patterns (2): Initial Experiments with Alignments from the Tableaux Phonétiques des Patois Suisses Romands

To run the script provided here, make sure to download the data from Zenodo, and unpack the folder multiple.zip. Then cd into the folder, and run the script as follows:

$ python to_wordlist.py

To install all requirements, just type:

@LinguList
LinguList / README.md
Created February 24, 2019 21:02
Automatic morpheme segmentation (Open problems in computational diversity linguistics 1)

Automatic morpheme segmentation (Open problems in computational diversity linguistics 1)

This little repository contains the analyses I have done to test the Morfessor software on sparse data. It should be mentioned that I just used the defaults for the computation, so it is quite possible, that the results could be further enhanced.

Requirements

To install Morfessor, just type:

$ pip install morfessor
@LinguList
LinguList / README.md
Created December 11, 2018 12:22
Merging datasets with LingPy and the CLDF curation framework
@LinguList
LinguList / README.md
Created November 6, 2018 10:50
Inferring consonant clusters from CLICS data with LingPy: Data and Code

Inferring consonant clusters from CLICS data with LingPy: Data and Code

This GIST accompanies the blogpost explaining the code, which you can finde here.

To install and run the code, run the following in your terminal:

$ pip install -r pip-requirements.txt
$ git clone https://github.com/clld/concepticon-data.git
$ cd concepticon-data
@LinguList
LinguList / README.md
Last active July 16, 2018 15:08
Exporting Sublists form a Wordlist with LingPy and Concepticon

Exporting Sublists form a Wordlist with LingPy and Concepticon

This gist describes, how you can extract sublists from a wordlist in LingPy with help of the pyconcepticon API. See https://calc.hypotheses.org/date/2018/07 for details on the code and additional explanations.

@LinguList
LinguList / README.md
Created June 28, 2016 14:11
Vowel Purity and Rhyme Evidence in Old Chinese Reconstruction

Vowel Purity and Rhyme Evidence in Old Chinese Reconstruction

Data

Data contains the rhyme network (in YAML-format), the different character readings (missing characters indicated by a "?", and the vowel annotations in JSON.

Code

To run the code, make sure you have Python3 installed, as well as a recent version of NetworkX and the community-extension for NetworkX.

@LinguList
LinguList / Sample_Size.md
Last active January 2, 2016 02:18
Supplementary Material for the Paper "Investigating the Impact of Sample Size on Cognate Detection"

Supplementary Material for the Paper "Investigating the Impact of Sample Size on Cognate Detection"

Format

The data-format is basically simple CSV format. Additional markup allows to add key-value descriptions of the dataset and to comment out parts of the data by using the hash character in the beginning of a line. A closer description of the input format (which can be regularly parsed with help of LingPy (http://www.lingpy.org), is given on http://lingpy.org/tutorial/lingpy.basic.wordlist.html.

Information

This dataset is part of the larger "Benchmark Database for Cognate Detection", currently hosted at http://quanthistling.info/bdhl/cognates.php.

@LinguList
LinguList / SCACognateDetection.md
Last active December 29, 2015 04:09
SCA Cognate Detection

SCA Cognate Detection Applied to ASJP Data

Carry out cognate detection analyses on ASJP-data (http://email.eva.mpg.de/~wichmann/ASJPHomePage.htm). By defining a language family or a genus, and specifying the parameters for the respective methods, this Python script carries out an automatic cognate detection analysis and outputs the data in aligned HTML format. For an overview on the three different cognate detection analyses, see the paper by List (2012, PDF version can be downloaded from: http://aclweb.org/anthology-new/W/W12/#0200).

@LinguList
LinguList / ChineseDialectHistory.md
Last active December 28, 2015 09:39
Python code to accompany the paper "Using Phylogenetic Networks to Model Chinese Dialect History".

Source code for the paper "Using Phylogenetic Networks to Model Chinese Dialect History"