Skip to content

Instantly share code, notes, and snippets.

@LinguList
LinguList / Networks_of_Lexical_Borrowing.md
Last active December 28, 2015 08:59
MLN reconstruction for Indo-European languages.

Source Code and Data for the Paper: "Networks of lexical borrowing and lateral gene transfer in language and genome evolution"

Usage

Usage is straightforward: Having downloaded all scripts (just clone this gist), cd into the folder and type:

@LinguList
LinguList / README.md
Last active August 29, 2015 14:02
PhylogeneticNetworkApproaches

Test Sets for Phylogenetic Network Approaches in Historical Linguistics

This GIST offers test sets for phylogenetic networks approaches. All data is given in different formats. The following formats are distinguished:

  • tree-representation of the underlying taxa using the Newick format (nwk-file)
  • csv-representation of the presence-absence patterns of the data (csv-file)
  • nexus-representation of the presence-absence matrix of the data (nex-file)
  • wordlist representation of the data which is important for additional linguistic analyses (qlc-format)

At the moment, only one testset is offered in these formats. This testset was the bases of our network analysis of 40 Indo-European languages (see https://gist.github.com/LinguList/7475830). Here, it is offered in the formats specified above. In this dataset, known borrowings have been deliberately reintroduced into the data, in order to see