Skip to content

Instantly share code, notes, and snippets.

@harjitmoe
Last active April 18, 2018 23:07
Show Gist options
  • Save harjitmoe/19f0412ec9bb8a6330bce6683fbbb760 to your computer and use it in GitHub Desktop.
Save harjitmoe/19f0412ec9bb8a6330bce6683fbbb760 to your computer and use it in GitHub Desktop.
IBUS table for Deseret
SCIM_Generic_Table_Phrase_Library_TEXT
VERSION_1_0
BEGIN_DEFINITION
# Library of Congress romanisation of Deseret (followed except for รฏ, which it does not define):
# > https://www.loc.gov/catdir/cpso/romanization/deseret.pdf
#
# Macron accents (overlines) are represented here by a single x prefix, other accents (umlaut and
# circumflex) are represented by a doubled x prefix.
#
# Very roughly, the accents themselves correspond to:
#
# - Macron: lengthening subject to applicable vowel shifts, as follows from a magic E (sid->side).
# - Circumflex: shortening, per the cot-caught distinction.
# - Umlaut: various other variations.
#
# As far as this IME is concerned, the difference between circumflex and umlaut is inconsequential,
# due to both being represented as "xx" before the vowel.
NAME = Deseret
AUTHOR = HarJIT <harjit@harjit.moe>
NAME.en_Dsrt = ๐”๐ฏ๐‘…๐จ๐‘‰๐ฏ๐ป
LANGUAGES = en_Dsrt
SERIAL_NUMBER = 20180417
UUID = 0aa1f540-b1e1-4637-814f-ade9372abe14
LICENSE = LGPL
DESCRIPTION = Deseret script, based on Library of Congress romanisation; use one or two x before instead of accents.
ICON = ibus-table.svg
STATUS_PROMPT = ๐”(๐ผ๐‘‚)
LAYOUT = default
VALID_INPUT_CHARS = ABCDEFGHIJKLMNOPRSTUVWXYZabcdefghijklmnoprstuvwxyz
MAX_KEY_LENGTH = 3
AUTO_COMMIT = TRUE
AUTO_SELECT = TRUE
DEF_FULL_WIDTH_PUNCT = FALSE
DEF_FULL_WIDTH_LETTER = FALSE
USER_CAN_DEFINE_PHRASE = FALSE
DYNAMIC_ADJUST = FALSE
END_DEFINITION
BEGIN_TABLE
# Kluge to make ibus work as expected with the prefixed x:
X X 0
x x 0
xx xx 0
XX XX 0
Xx Xx 0
# ฤ“, name "Long I", magic-E extension of E (= straight extension of i).
xE ๐€ 1
XE ๐€ 1
Xe ๐€ 1
xe ๐จ 1
# ล, name "Long O", magic-E extension of O.
xO ๐„ 1
XO ๐„ 1
Xo ๐„ 1
xo ๐ฌ 1
A ๐ˆ 1
a ๐ฐ 1
# ฤซ, name "Ay", magic-E extension of I (as in, aye-aye).
xI ๐Œ 1
XI ๐Œ 1
Xi ๐Œ 1
xi ๐ด 1
# H, name "H"
H ๐ 1
h ๐ธ 1
# D, name "Dee"
D ๐” 1
d ๐ผ 1
# G, name "Gay" (hard G)
G ๐˜ 1
g ๐‘€ 1
# Dh, name "Thee", as in รฐ (THis and THat)
Dh ๐œ 1
DH ๐œ 1
dh ๐‘„ 1
# Zh, name "Zhee", as in ลพ (reviSIon)
Zh ๐  1
ZH ๐  1
zh ๐‘ˆ 1
# N, name "En"
N ๐ค 1
n ๐‘Œ 1
# ฤ, name "Long E", magic-E extension of A, alternative extension of E.
# Note: there is apparently no letter for a straight lengthening of E. Presumably the
# Mary-Merry-Marry distinction (where it exists) would have to be represented using
# ๐ฃ๐ฏ๐ฏ๐‘‰๐จ-๐ฃ๐ฏ๐‘‰๐จ-๐ฃ๐ฐ๐‘‰๐จ. One can deduce it was incomplete in the Deseret dialect at the time of
# introduction.
xA ๐ 1
XA ๐ 1
Xa ๐ 1
xa ๐ฉ 1
# ลซ, name "Long Oo", magic-E extension of U (also the "oo" in "spoon").
xU ๐… 1
XU ๐… 1
Xu ๐… 1
xu ๐ญ 1
# รด, confusingly named "Short Ah", former side of the cot-caught distinction.
# Note the distinction between ๐‰ (๐ฟ๐ฑ๐ป=cot, ๐บ๐ฑ๐‘„๐‘‰=bother), ๐ƒ (๐ฟ๐ซ๐ป=caught) and ๐‚ (๐‘๐ช๐‘„๐‘‰=father) may be
# incomplete in some accents.
xxO ๐‰ 1
XXO ๐‰ 1
Xxo ๐‰ 1
xxo ๐ฑ 1
# Aw, name "Ow", as in OUt, bOUT, cOW (phonemic aw; English non-phonemic "aw" is o).
Aw ๐ 1
AW ๐ 1
aw ๐ต 1
# P, name "Pee"
P ๐‘ 1
p ๐น 1
# Kluge to make ibus work as expected with the prefixed c:
C C 0
c c 0
# Ch, name "Chee"
Ch ๐• 1
CH ๐• 1
ch ๐ฝ 1
# F, name "Eff"
F ๐™ 1
f ๐‘ 1
# S, name "Ess"
S ๐ 1
s ๐‘… 1
# R, name "Er"
R ๐ก 1
r ๐‘‰ 1
# Ng, name "Eng"
Ng ๐ฅ 1
NG ๐ฅ 1
ng ๐‘ 1
# รค, name "Long A", straight extension of A.
# Note the distinction between ๐‰ (๐ฟ๐ฑ๐ป=cot, ๐บ๐ฑ๐‘„๐‘‰=bother), ๐ƒ (๐ฟ๐ซ๐ป=caught) and ๐‚ (๐‘๐ช๐‘„๐‘‰=father) may be
# incomplete in some accents.
xxA ๐‚ 1
XXA ๐‚ 1
Xxa ๐‚ 1
xxa ๐ช 1
# I, name "Short I", as in pit, sit, bid.
I ๐† 1
i ๐ฎ 1
# U, potentically confusing name "Short O", as in such, much, dud, son, won.
U ๐Š 1
u ๐ฒ 1
# W, name "Wu"
W ๐Ž 1
w ๐ถ 1
# B, name "Bee" (possibly a deliberate allusion to "Deseret"?)
B ๐’ 1
b ๐บ 1
# J, name "Jee"
J ๐– 1
j ๐พ 1
# V, name "Vee"
# Note: Noto appears to use a rather odd glyph variant in which the stroke does not join itself
# on the left, i.e. it appears as a mirrored ฤ. This may be an issue, in that some handwritten
# styles apparently draw the ฤ backward to start with (Noto doesn't).
V ๐š 1
v ๐‘‚ 1
# Z, name "Zee"
Z ๐ž 1
z ๐‘† 1
# L, name "El"
L ๐ข 1
l ๐‘Š 1
# Oy, name "Oi", ligature of รด and i. Earlier versions of Deseret used a simple digraph, later
# versions used a ligature in one of two mutually illegible glyph variants (the Unicode chart
# uses the older of the ligatures).
Oy ๐ฆ 1
OY ๐ฆ 1
oy ๐‘Ž 1
# O, name "Long Ah", as in รฅ (cAUGHt, pAW).
# Note the distinction between ๐‰ (๐ฟ๐ฑ๐ป=cot, ๐บ๐ฑ๐‘„๐‘‰=bother), ๐ƒ (๐ฟ๐ซ๐ป=caught) and ๐‚ (๐‘๐ช๐‘„๐‘‰=father) may be
# incomplete in some accents.
O ๐ƒ 1
o ๐ซ 1
# E, name "Short E", as in bed.
E ๐‡ 1
e ๐ฏ 1
# รผ, name "Short Oo", as in put, or the "oo" in cook and look.
xxU ๐‹ 1
XXU ๐‹ 1
Xxu ๐‹ 1
xxu ๐ณ 1
# Y, name "Yee"
Y ๐ 1
y ๐ท 1
# T, name "Tee"
T ๐“ 1
t ๐ป 1
# K, name "Kay"
K ๐— 1
k ๐ฟ 1
# Th, name "Eth", as in รพ (THink, THankyou).
# (Name not to be confused with the Roman letter eth or edh (รฐ), which is Dh above.)
Th ๐› 1
TH ๐› 1
th ๐‘ƒ 1
# Sh, name "Esh".
Sh ๐Ÿ 1
SH ๐Ÿ 1
sh ๐‘‡ 1
# M, name "Em".
M ๐ฃ 1
m ๐‘‹ 1
# Kluge to make ibus work as expected with the prefixed xy:
xY xY 0
xy xy 0
XY XY 0
Xy Xy 0
# Yลซ, name "Ew", ligature of i with ลซ. Earlier versions of Deseret used a simple iลซ or yลซ digraph,
# later versions used a ligature in one of two mutually illegible glyph variants (the Unicode chart
# uses the older of the ligatures).
xYu ๐ง 1
XYu ๐ง 1
xYU ๐ง 1
XYU ๐ง 1
Xyu ๐ง 1
xyu ๐‘ 1
# รฏ, Shelton Schwa, as in centEr. Had limited adoption. Actually now uses Latin Iota. This is the
# only representation here which does not follow Library of Congress romanisation, due to there not
# being one.
xxI ฦ– 1
XXI ฦ– 1
Xxi ฦ– 1
xxi ษฉ 1
END_TABlE
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment