Skip to content

Instantly share code, notes, and snippets.

View boblannon's full-sized avatar

Bob Lannon boblannon

View GitHub Profile
@planemad
planemad / india-place-language.md
Last active October 19, 2022 18:04
The language of Indian places

India, as is already well known, is a complex universe. The 1.2 billion people who inhabit an area 1/3rd the area of the US speak over 1000 native tongues. There are 2 official languages in use by the federal government for administration - English and Hindi. The Indian Union gives special status to 22 scheduled languages which are considered the major regional languages, each with many millions of native speakers. These 22 languages are what can be practically considered the major languages of India that will cover over 95% of the population.

To take complexity to the next level, there are 9 writing systems that are exclusively found only in India and are in active use for their respective languages. While most scheduled languages have their dedicated writing system, other languages have adopted a writing system like the case of [Konkani](https://en.wikipedia.o

@JoFrhwld
JoFrhwld / syllabify.R
Last active December 15, 2015 23:29
This script does a very simple onset maximization syllabification.
syllabify <- function(trans){
require(stringr)
segments <- unlist(str_split(trans, " "))
nuclei_string <- "A|E|I|O|U|@"
nucs <- grep(nuclei_string, segments)
n <- length(nucs)
r_colored <- nucs[grep("R", segments[nucs])]