Skip to content

Instantly share code, notes, and snippets.

@funderburkjim
funderburkjim / MWlsmissing
Last active August 29, 2015 14:07
Records of Monier-Williams which have a missing literary source.
----------------------------------------------------------------------
AdhyR. = आध्यात्म-रामायणम्
Occurs in 3 records of MW
74633 cumbaka m. -maRi Prab. vi , 16 **AdhyR.** i , 1 , 18
78697 jAtiBrazwa mfn. fallen from caste , **AdhyR.** i ,1 , 56 .
89684 daRqavat ind. ( with pra-Ramya , prostrating the body ) in a
straightline , **AdhyR.** Introd. 5.
------------------------------------------------------------------------
001 akaTita
001 akanizTa -> akanizWa :()
001 akampita
------------------------------------------------------------------------
002 agnida
002 agbudagDa -> agnidagDa :(agradagDa,anudagDa)
002 agnidUta
------------------------------------------------------------------------
003 agniSiKa

001 ajyeyatA 001 aRIcinmOna -> aRIcin (acintana,aRuBinna,aticintana,anucintana,aricintana,avicintana,avIcimant,avIcimaya) 001 aRu 001 headword aRIcinmOna --- page 1-014


002 aBipitva 002 aBipratArinkAkzaseni -> () 002 aBipraSnin

@funderburkjim
funderburkjim / vei-fuzzyalpha.py
Created April 17, 2015 02:26
fuzzyalpha program
""" fuzzyalpha.py
Apr 14, 2015 for VEI - applied to faultfinder
Attempt to get spelling change suggestions for Sanskrit.
Usage: python26 fuzzyalpha.py vei-only-notrxx-page.txt fuzzyalpha.txt ../../../../../awork/sanhw1/sanhw1.txt ../../veihw2.txt
2nd usage
python26 fuzzyalpha.py vei-nonverbs1.txt fuzzyalpha1.txt ../../../../../awork/sanhw1/sanhw1.txt ../../veihw2.txt
input.txt is a list of headwords, one per line, in slp1 transliteration
Note: This is specialized to vei-only-notrxx-ff.txt
@funderburkjim
funderburkjim / levenshtein.py
Created April 17, 2015 02:28
Edit distance module used by fuzzyalpha.py
"""Levenshtein distance between 2 strings.
Source: http://en.wikibooks.org/wiki/Algorithm_Implementation/Strings/Levenshtein_distance#Python, first version
Jan 5, 2015. This file copied from TM2013/vcpte/ejf/vac-vcp-cmp1/ folder.
"""
def levenshtein1(s1, s2,m):
# returns levenshtein distance, but returns m if
# the distance would be greater than or equal to m.
# this is done for efficiency in tests such as
# if levenshtein1(s1, s2,m) < m:
# returns -1 if distance is > m
@funderburkjim
funderburkjim / vei-only-notrxx-page.txt
Created April 17, 2015 02:30
input file used by fuzzyalpha.py
aRIcinmOna:1-014
aBipratArinkAkzaseni:1-027
aByagniEtaSAyana:1-029
amAvAsyaSARdilyAyana:1-031
aruRaOpaveSigOtama:1-035
ahInAASvatTya:1-051
AruRaOpaveSi:1-062
aSvatTva:1-069
iwantkAvya:1-076
uccEHSravaskOpayeya:1-084
@funderburkjim
funderburkjim / config.yaml
Created May 2, 2015 00:47
shell session for an install of vagrant machine using puphpet
vagrantfile:
target: local
vm:
box: puphpet/ubuntu1404-x64
box_url: puphpet/ubuntu1404-x64
hostname: local.puphpet
memory: '512'
cpus: '1'
chosen_provider: virtualbox
network:
@funderburkjim
funderburkjim / pd_deva_ne_iast.txt
Created May 11, 2015 01:37
Headwords from Cologne digitization were Devanagari spelling inconsistent with IAST spelling
1-0002b:aicadeva:208,209:aicadeva:Ecadeva
1-0002b:aiculA:210,211:aicula1:EculA
1-0002b:aibuka:212,213:aibuka:Ebuka
1-0002b:aivuli:214,215:aivuli:Evuli
1-0027b:aMsapArSvAmitApa:3435,3437:am3sapa1rs4va1bhita1pa:aMsapArSvABitApa
1-0028b:aMsaBitti:3509,3514:am3sa-bhitti1:aMsaBittI
1-0032a:aMhi:3979,4011:am3h-ri:aMhri
1-0032b:aMhrayagra:4063,4064:am3hryagra:aMhryagra
1-0033b:aMsamadaBaYjana:4177,4179:a-kam3samadabhan5jana:akaMsamadaBaYjana
1-0036a:akaTayitavya:4522,4523:a-kathayitvya:akaTayitvya
@funderburkjim
funderburkjim / headwordforms.md
Created May 6, 2017 21:13
Headword forms in Cologne dictionaries

acc.txt

<HI>{#akzamAlApratizWA#}¦

<HI>{#KEY2#}¦
KEY2 slp1

ap90.txt

<P>.{#a#}
@funderburkjim
funderburkjim / headwordforms.md
Created May 6, 2017 21:13
Headword forms in Cologne dictionaries

acc.txt

<HI>{#akzamAlApratizWA#}¦

<HI>{#KEY2#}¦
KEY2 slp1

ap90.txt

<P>.{#a#}