Skip to content

Instantly share code, notes, and snippets.

@norman
Created May 20, 2010 19:51
Show Gist options
  • Save norman/407995 to your computer and use it in GitHub Desktop.
Save norman/407995 to your computer and use it in GitHub Desktop.

Some random examples of Spanish words transcribed to International Phonetic Alphabet by my in-progress spanish linguistics library and phonology library.

If you're interested in this stuff, you may want to take a look at how it first transcribes orthography to a base IPA form, then applies phonological rules to the resulting sequence. Then, syllabification and stress rules are applied using the Sequence of Sonority princple.

I'm currently working on making the phonological rules configurable so that dialectical phenomena like yeísmo, nasal velarization, Rioplatense zheísmo and sheísmo, s-aspiration, seseo, etc. can all be specified to produce a reasonable IPA representation of any Spanish word in any Spanish dialect.

This library has been particularly enjoyable to implement in Ruby since it was possible to create easy-to-read DSL's for phonology and orthography rules. My hope is to eventually make something easy enough to write, that non-programmer linguists can use it to do phonological experiments. That's probably a ways off though.

   desacotar        de sa ko ˈtaɾ
sinsubstancia    sin subs ˈtan sja
  confitillo        kon fi ˈti ʎo
  desencolar       de sen ko ˈlaɾ
       friso              ˈfɾi so
   bienestar         bje nes ˈtaɾ
  hidroavión        i dɾo a ˈβjon
      atajar            a ta ˈxaɾ
   neumático        new ˈma ti ko
     babismo           ba ˈβiz mo
 refrangible      re fɾan ˈxi ble
    lucífero         lu ˈsi fe ɾo
   castañedo        kas ta ˈɲe ðo
desocasionado  de so ka sjo ˈna ðo
    murmullo           muɾ ˈmu ʎo
      exceso           eks ˈse so
   coriláceo       ko ɾi ˈla se o
    barquero           baɾ ˈke ɾo
         boj                 ˈbox
        hoyo                ˈo ʝo
    escobero         es ko ˈβe ɾo
     iguaria           i ˈɣwaɾ ja
  chinchorro       t͡ʃin ˈt͡ʃo ro
  centesimal       sen te si ˈmal
  progresivo       pɾo ɡɾe ˈsi βo
     muévase           ˈmwe βa se
    suprimir          su pɾi ˈmiɾ
       útero             ˈu te ɾo
     frígido           ˈfɾi xi ðo
  necrología      ne kɾo lo ˈxi a
     franjón            fɾan ˈxon
     bazucar           ba su ˈkaɾ
     prelado           pɾe ˈla ðo
    embancar          em ban ˈkaɾ
  anfetamina      an fe ta ˈmi na
     golfear           ɡol fe ˈaɾ
  impeditivo      im pe ði ˈti βo
 inacentuado     i na sen ˈtwa ðo
    sufragio          su ˈfɾa xjo
       eruto             e ˈɾu to
    poderoso         po ðe ˈɾo so
    estático         es ˈta ti ko
      vahído             ba ˈi ðo
    poplíteo         po ˈpli te o
   prolusión         pɾo lu ˈsjon
  aumentador       aw men ta ˈðoɾ
    traedura         tɾa e ˈðu ɾa
        dele               ˈde le
      lobina            lo ˈβi na
   furúnculo        fu ˈɾun ku lo
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment