Skip to content

Instantly share code, notes, and snippets.

@DogLooksGood
Created February 25, 2021 02:48
Show Gist options
  • Save DogLooksGood/d41cf18a38701190ba720d30c9a8a845 to your computer and use it in GitHub Desktop.
Save DogLooksGood/d41cf18a38701190ba720d30c9a8a845 to your computer and use it in GitHub Desktop.
utf8demo
UTF-8 encoded sample plain-text file
‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾
Original version from Markus Kuhn [ˈmaʳkʊs kuːn] from University of Cambridge
http://www.cl.cam.ac.uk/~mgk25/
The original "xterm-UTF-8-demo.txt"
https://www.cl.cam.ac.uk/~mgk25/ucs/examples/UTF-8-demo.txt
https://gist.github.com/msabramo/3921955
UTF-8 test file (closely related)...
https://www.cl.cam.ac.uk/~mgk25/ucs/examples/UTF-8-test.txt
This version has been updated with more sets, and some other symbols as
a reference.
This page started from the above fore mentioned example page but has expanded
far beyond the original. It is particularly focused on the interactions
between of Unicode characters, so they work as a seamless whole. It is meant
to provide a reference of unicode that work well together, and report what
doesn't work, even though it should work.
Before using Unicode in applications I recommend you also look at
"unicode.txt" in the same directory. Many examples below are designed to work
with a fixed-width or non-proportional font.
Summery...
X Windows (XTerm, Vim)...
The font I most commonly use is...
normal: "-misc-fixed-medium-r-*-*-15-*-75-75-c-90-iso10646-1"
bold: "-misc-fixed-bold-r-*-*-15-*-75-75-c-90-iso10646-1"
These fonts are not the same as the more classic "9x15" and "9x15bold" fonts.
While the above normal font implements most of the glyphs, the bold font
implements almost none of the UTF-8 glyphs (just the default box). This makes
mixing the bold text (with hi intensity colors XTerms also adds) with normal
UTF-8 glyphs (and with out hi-intensity color) difficult. Of course you can
set the color yourself, but that depends on background, where bolding does not!
But for the display of plain text the above works great.
The ISO10646 'fixed' font (smaller), and the above fonts implements almost all
basic Unicode glyphs, with a couple of minor mistakes. The newer ones have not
been added as it is not in active development. One specific glyph that is
sorely missed is, "vertical line extension". All other X window fonts I have
tried are very incomplete, with most of the Unicode glyphs missing, even though
a font may be ISO10646 (Unicode) complaint.
The Combining characters (Diacritical and Symbol) do however just work when
output to "XTerms", or used in "vim", with the exception of a few of the more
unusual space characters (see below), which you should rarely come across.
ASIDE: This is still my preferred terminal, editor, and font, even though the
newer Unicode glyphs (or emoji) are not defined.
GTK Fonts (Chrome, Firefox, Pango, GEdit, GVim, Kate)...
This is now favored by more modern applications (which often don't have the
range of configuration options options of the older xterm applications). They
do define practically ALL the Unicode glyphs, including the newer ones, but
without showing the same care about interactions between characters that the
older X window fonts did, especially with 'graphical' glyphs.
The "Monospace 10" font works best of all, which is not surprising as it is the
default for fixed-width font in GTK. However all the modern applications
listed above (except for "Kate") leaves annoying gaps between lines. I also find the normal dash ('-') character to be annoyingly short.
"Terminus" font does not leave gaps when used with "gvim", but fails being
totally "fixed width", (see "gvim" note below). It contains an annoying amount
of proportional characters in what should be a fixed-width font. Mileage
varies.
Notes and Problems...
If the first non-space character in a line is special (braille, some math
characters, diacritic), then all the spaces before that character can be
replaced with matching half-width spaces rather than the normal fixed-width
space. Aargh...
"Combining Diacritical Marks" generally work in all GTK applications,
but often to handling of "Combining Symbols" (block U+20D0) properly.
EG: These very useful combined characters X⃞ ╳⃞ ✓⃞ ✗⃞ do not always work...
Works in: "Firefox", "GVim"
Fails in: "Gedit", "Kate" "Pango", "chrome"
"Gvim" works hard to ensure that even proportional fonts are handled as if it
is fixed-width. However this still results in some quirks, like all characters
being double-width for fonts with fixed-width glyphs.
The KDE "Kate" editor works better in this regard.
Anthony Thyssen, August 2019
===============================================================================
Typographical Usage of Unicode:
╔══════════════════════════════════════════╗
║ ║
║ • ‘single’ and “double” quotes ❛ ❜ ❝ ❞ ║
║ ║
║ • Curly apostrophes: “We’ve been here” ║
║ ║
║ • Latin-1 apostrophe and accents: '´` ║
║ ║
║ • ‚deutsche‘ „Anführungszeichen“ ║
║ ║
║ • †, ‡, %, ‰, •, 3–4, —, −5/+5, ™, … ║
║ ║
║ • ASCII safety test: 1lI|, 0ODQ, 8B ║
║ ╭────────────────╮ ║
║ • Currency in Box: │ 14.95€ 5£ 2¢ │ ║
║ ╰────────────────╯ ║
╚══════════════════════════════════════════╝
Mathematics and Science Usage:
∮ e⋅da = q, n → ∞, ∑ f(i) = ∏ g(i),
∀x∈ℝ: ⌈x⌉ = −⌊−x⌋, α ∧ ¬β = ¬(¬α ∨ β),
⟦ ℤ ⊂ ℚ ⊂ ℝ ⊂ ℂ ⊂ ℕ ⊆ ℕ₀ ⟧,
a ≠ b ≡ c ≤ d ≥ e (⟪a⟫ ⇔ ⟪b⟫)
2h₂ + o₂ ⇌ 2h₂o, r = 4.7 kω,
∆d ≈ √2, ∛5, ⌀200 mm, ∠±30°, ⊾⊿
⎧ ⎡ ⎛ ⎞ ⎤ ⎫ ⌠ ┌─────┐ ∞
⎪ ⎢ ⎜ ⎟ ⎥ ⎪ ⎮ │a²+b³ ⎲
⎨ ⎢ ⎜ ⎟ ⎥ ⎬ ⎮ │───── ⎳aⁱ-bⁱ
⎪ ⎢ ⎜ ⎟ ⎥ ⎪ ⎮ ⎷ c₈ i=1
⎩ ⎣ ⎝ ⎠ ⎦ ⎭ ⌡
In the above examples bracket alignment requires a fixed with font.
X Window 'fixed' fonts work correctly for all the above.
All other (non-fixed width) X fonts are missing a lot of glyphs.
GTK "Monospace" works well, except for the extended square root base ('⎷')
character. This displays more like a normal square-root ('√').
Also double height summation (like '∑') do not join up correctly vertically.
-------------------------------------------------------------------------------
Typographical:
dash: - ‘single’ apostrophe '
ndash: – “double” ascents ´
mdash: — ellipsis … decent `
❛ fancy ❜ ❝ quotes ❞
Spaces:
I use this as a reference of 'invisible characters'.
No Break Space -> <- \u00A0 (also called meta-space)
Narrow No break -> <- \u202F
Space Unicode: \u2000 to \u200B '           ​'
' ' \u205F Math Space
'⠀' \u2800 Braile Space
' ' \u3000 Chinese Ideograph Space (double width)
'⁠' \u2060 Word Joiner (vim has trouble, and stuffs rest of line)
'' \uFEFF Zero Width - no break space
Various Symbols Grouped by Type:
Ticks & Crosses: ✓ ✔ ✕ ✖ ✗ ✘ √
╳ ☓ Ⅹ × x (last is a normal 'x')
☐ ☑ ☒ 〿 X⃞ ╳⃞ ✓⃞ ✗⃞ (last group are combined symbols)
Centered: ․ ⋅ ‧ ٠ ᛫ ・ • ∙ ●
⋄ ᛜ ∘ ◦ ◌ ○ ๐ ໐ o ◯  0 O
⊙ ⊚ ◉ ◎ ๏ ⌾ ☉ ʘ ⨀ (goes wierd in GTK)
․ . ⡀ ⢀ ⠠ ⠄ (period and braile dots)
。 ⋄ ° ゚ ﹾ
ᛜ ◇ ⟐ ◈ ◆ ◊
▵ ▴ Δ △ ◬ ▲
▫ ▪ ◻ ◼ ▢ □ ⊡ ▣ ■
.⃝ ⋅⃝ ∘⃝ •⃝ •⃟ •⃞ ▫⃞ (these are combined symbols)
Stars: ⋆ ★ ☆ ✩ ✫ ✬ ✭ ✮ ✯ ✰
˖ + ⊹ ᛭ ✛ ✜ ✚
☩ ⌖ ✢ ✣ ✤ ✥ ⟡ ✧ ✦
* ⁎ ᛡ ✶ ✲ ✱ ✻ ✼ ✽ ✾
Arrows: ← ↖ ↑ ↗ → ↘ ↓ ↙
⏎ ☇ ↪ ➥ ➞ ➡ ➟ ➠ ⟹ ⟶ ⟵ ⟸ ⟷
Arrows Multi: ──➤ ══━━➤ ◅―――▻ ⊲―――⊳ ≺―――≻ ◁―――▷ ◀―――▶
Punctuation: ? ¿ ␦ ! ‼ ¡ ❢
Brackets: ( ⸨ ⸩ ) ⁽ ₍ ⁾ ₎ [ ⟦ ⟧ ]
≺ ‹ < ⟨ ⟩ > › ≻
⟪ « ≪ ≫ » ⟫
Vertical Bars: | ¦ ❘ ❙ ❚ │ ╎ ┆ ┊ ╷❘ﺍ╵
Maths: × ÷ ±
Quote Pairs: '' "" `´ ‚‛ “” „‟ ❛❜ ❝ ❞
Ellipses: ․ ‥ … ⋯ ┈ ┉ ┄ ┅
Footnotes: ¶ ¥ § ¬ † ‡ ⁋ ⁑ ⌘ ϯ Ϯ ϟ ☥
Other symbols: ❤ ♡ ♂ ♀ ☠ ✄ ☢ ∞ ⌑ ␥ ⑅
€ £ ¢ ℃ ℉ © ® ™ ⋮
␛ ␠ ␡ ␄ ‸ ﬩ ␣ ¬ ⁺
❖ ⸭ ⸪ ⸫ ⸬ ። ፡ ፧ ⡇ ⠶ ⣤ ⣿ (goes weird in GTK)
⌚ ⌛ (double width, in x window!)
Smiley, faces: ☺ ☹ ☻ ⍨
㋡ ㋛ 〠 シ ッ ツ ヅ (double width, not in x windows)
Morphology
⊙ ⊚ ⊗ ⊕ ⊖ ⊘ ⊛ ⊜ ⊝
Part circle
● ◐ ◑ ◒ ◓ ◔ ◕ ◖ ◗
Super-scripts (are not in sequence)...
⁰ ¹ ² ³ ⁴ ⁵ ⁶ ⁷ ⁸ ⁹ ⁺ ⁻ ⁼ ⁽ ⁾ ⁱ ⁿ
Sub-scripts...
₀ ₁ ₂ ₃ ₄ ₅ ₆ ₇ ₈ ₉ ₊ ₋ ₌ ₍ ₎
Fractions
¼ ½ ¾ ⅓ ⅔ ⅕ ⅖ ⅗ ⅘ ⅙ ⅚ ⅛ ⅜ ⅝ ⅞ % ℅ ‰ ‱
Roman numerals
Ⅰ Ⅱ Ⅲ Ⅳ Ⅴ Ⅵ Ⅶ Ⅷ Ⅸ Ⅹ Ⅺ Ⅻ Ⅼ Ⅽ Ⅾ Ⅿ
ⅰ ⅱ ⅲ ⅳ ⅴ ⅵ ⅶ ⅷ ⅸ ⅹ ⅺ ⅻ ⅼ ⅽ ⅾ ⅿ
Circle Numbers
➀ ➁ ➂ ➃ ➄ ➅ ➆ ➇ ➈ ➉
➊ ➋ ➌ ➍ ➎ ➏ ➐ ➑ ➒ ➓
⓪ ① ② ③ ④ ⑤ ⑥ ⑦ ⑧ ⑨
⑩ ⑪ ⑫ ⑬ ⑭ ⑮ ⑯ ⑰ ⑱ ⑲ ⑳
Bracket Numbers
⑴ ⑵ ⑶ ⑷ ⑸ ⑹ ⑺ ⑻ ⑼ ⑽ ⑾ ⑿ ⒀ ⒁ ⒂ ⒃ ⒄ ⒅ ⒆ ⒇
Number Period
⒈ ⒉ ⒊ ⒋ ⒌ ⒍ ⒎ ⒏ ⒐ ⒑ ⒒ ⒓ ⒔ ⒕ ⒖ ⒗ ⒘ ⒙ ⒚ ⒛
These are all pretty good. Though GTK fonts do replace some characters
with emoji equivalents.
Also see Spinners for some unicode animation effects...
https://antofthy.gitlab.io/info/ascii/Spinners.txt
-------------------------------------------------------------------------------
Drawing Characters...
Box Drawing, Block Elements & Geometric Shapes U+2500:
Box Character Examples...
┌┬┐ ╓╥╖ ┌┬┐ ┎┰┒ ╭╷╮ ╃╀╄ ╆╈╅ │╎┆┊ ┃╏┇┋ █ ▄████▄
├┼┤─╟╫╢ ├┼┤─┠╂┨ ╶┼╴ ┽┼┾ ╊╋╉ │╎┆┊ ┃╏┇┋ ▉ ╱╲╱╲╳╳╳ ▐▌ ▐▌
└┴┘ ╙╨╜ └┴┘ ┖┸┚ ╰╵╯ ╅╁╆ ╄╇╃ │╎┆┊ ┃╏┇┋ ▊ ╲╱╲╱╳╳╳ ▄▀▀█▀ ▐▌
│ ║ │ ┃ ▋ ╱╲╱╲╳╳╳ ▄ ▐▄ ▐▌▀▀▄
╒╤╕ ╔╦╗ ┍┯┑ ┏┳┓ ┏╻┓ ┌╽┐ ┟┲┱┧ ──── ━━━━ ▌ ╲╱╲╱╳╳╳ ▐▀ ▄▄ ▀▌ ▄▀▀ ▀▄ ▀
╞╪╡═╠╬╣ ┝┿┥━┣╋┫ ╺╋╸ ╼╋╾ ┞┺┹┦ ╌╌╌╌ ╍╍╍╍ ▍ ▐ ▀██▀ ▌▐ ▄██▄ ▌
╘╧╛ ╚╩╝ ┕┷┙ ┗┻┛ ┗╹┛ └╿┘ ┢┭┮┪ ┄┄┄┄ ┅┅┅┅ ▎ ▁▂▃▄▅▆▇█ ▀▄ ▄▄▀ ▐ ▀▀ ▌
┡┵┶┩ ┉┉┉┉ ┉┉┉┉ ▏ █ ▀▄▄ ▄▀
╔══╦══╗ ┌──┬──┐ ╭──┬──╮ ╭──┬──╮ █ █ █ ▐
║┌─╨─┐║ │╔═╧═╗│ │╒═╪═╕│ │╓─╁─╖│ ▛▀▜ ▗▄▖ ╲│╱ █ █ ▐▌ █
║│ │║ │║ ║│ ││ │ ││ │║ ┃ ║│ ▌█▐ ▐ ▌ ▞▚ ─╳─ █ █ ▐▌ █
╠╡ ╞╣ ├╢ ╟┤ ├┼─┼─┼┤ ├╫─╂─╫┤ ▙▄▟ ▝▀▘ ▚▞ ╱│╲ ▐▌▐▌ █ █
║│ │║ │║ ║│ ││ │ ││ │║ ┃ ║│ ▐▌ █▄ ▐▌ █
║└─╥─┘║ │╚═╤═╝│ │╘═╪═╛│ │╙─╀─╜│ ░░▒▒▓▓██ █ ▀▀▀ ▐
╚══╩══╝ └──┴──┘ ╰──┴──╯ ╰──┴──╯ ░░▒▒▓▓██ ▐▌ █
█▄ ▄█
Related Characters: ˖ + ﺍ
Other Examples...
https://en.wikipedia.org/wiki/Box-drawing_character
https://www.vidarholen.net/cgi-bin/labyrinth?w=13&h=13
http://xahlee.info/comp/unicode_drawing_shapes.html
┌─┬┐ ╔═╦╗ ╓─╥╖ ╒═╤╕ ┌─┬─────────┬─────────┐ ┌┮┭┬┬┲┱┬┬┬┐
│ ││ ║ ║║ ║ ║║ │ ││ │ └─┐ ┌───┐ ├───┬───┐ │ ┟┼┼┼┼╄╃┼┼┼┧
├─┼┤ ╠═╬╣ ╟─╫╢ ╞═╪╡ ├─┐ │ └─┐ │ │ ╷ ╵ ╷ ╵ │ ┞┼┼┼┼┼┼┼┼┼┦
└─┴┘ ╚═╩╝ ╙─╨╜ ╘═╧╛ │ │ │ ╷ │ ╵ │ └───┴─┐ │ ┢╅┼┼┼┼┼┼┼╆┪ \_wrong in
┌───────────────────┐ │ │ └─┤ │ ╶─┴─────╴ │ │ ┡╃┼┼┼┼┼┼┼╄┩ / xterms
│ ╔═══╗ Some Text │▒ │ └─╴ │ │ ┌─────┬───┤ │ ├┼╆╈╅┼┼╁┼┼┤
│ ╚═╦═╝ in the box │▒ │ ╶─┬─┤ └─┘ ╶─┐ │ ┌─┘ │ ├┼╊╋╉┼┾╋┽┼┤
╞═╤══╩══╤═══════════╡▒ ├─┐ │ ╵ ┌─────┤ │ ╵ ╶─┤ ├┼╄╇╃┼┼╀┼┼┤
│ ├──┬──┤ │▒ │ │ └─┐ │ ┌─╴ │ └─┬─┐ │ ├┼┼┼┼╆╅┼┼┼┤
│ └──┴──┘ │▒ │ └─┐ └─┘ │ ╶─┴─┐ │ ╵ │ └┶┵┴┴┺┹┴┴┴┘
└───────────────────┘▒ │ ╶─┴─────┴───╴ ╵ │ ╶─┤
▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒▒ └─────────────────┴───┘
Almost right except for a couple of mistakes in X window "fixed" font
And all work perfectly for GTK "Terminus" font.
Other web sites...
http://tamivox.org/dave/boxchar/index.html
http://clubmate.fi/using-pseudographics-in-blogposts-drawing-ascii-diagrams-and-boxes/
Warning: box lines do not always work with other shapes. But should!
Obviously font designers do not care about box drawing fonts all that much!
As such they really only work in fixed width terminals, like xterms ☹
╷ ∧ ⊤
◁──╯╭──▷ ╱ ╲ ╱ ⊢┼──◠─◡──⌃─⌄──⊣
╵ ∨ ⊥
⇐═══⇒ ⊨═══ ⊫═══⋕═══≑═══≡═══≣═══ ⊩── ⊪──
⟸═══⟹ Longer proportional font double line arrows
─────➤ ═════➤ Right arrow heads (x windows and GTK "Monospace")
These do not connect to each other properly, but good for framing
◿ ◺ ◢ ◣ ◜◝   ⌌ ⌍ ⌜ ⌝ ⎾ ⏋ ⌃ ◠ ⌢ ⎴
◹ ◸ ◥ ◤ ◟◞   ⌎ ⌏ ⌞ ⌟ ⎿ ⏌ ⌄ ◡ ⌣ ⎵
^- this is wrong in X "misc 9x15 Unicode"
Horizontal Lines U+2300:
vv----- horizontal line extension
▔▔ ⎺⎺⎻⎻⎯⎯⎼⎼⎽⎽ ▁▁ ── box drawing horizontal line ―― horizontal bar
^^------------^^--- 1/8 block, top and bottom ━━ bold line
▷───◁ \u2500 box drawing horizontal line
▷⎯⎯⎯◁ \u23af horizontal line extension
▷―――◁ \u2015 horizontal bar (used in arrows below)
◅―――▻ ◄―――► ⊲―――⊳ ≺―――≻
◃―――▹ ◂―――▸ ⊢―――⊣ ⊰―――⊱ ━━――━━══――
◁―――▷ ◀―――▶ ⟵―――⟶ «―――» ⟝―――⟞
Notes:
\u2500 box drawing line, should work but often doesn't.
\u23af horizontal line extension works well, though "chrome" replaces it.
\u2015 horizontal bar, works but is very long in proportional fonts.
All work perfectly for X window "fixed" fonts.
Only "horizontal bar" works for GTK "Monospace"
Vertical Lines U+2300:
There is a lot of vertical bars for extended brackets in the U+2300 unicode
block. You should ensure you use the right one for each bracket type. See
example in the "Mathematics and Sciences" section above.
▏ \u258f 1/8 block left
⎸ \u23b8 left box line (does not work well)
⎢ \u23a2 ⎡ ⎢ ⎣ left square bracket
⎜ \u239c ⎛ ⎜ ⎝ left parenthesis
⎪ \u23aa ⎧ ⎨ ⎩ ⎪ ⎫ ⎬ ⎭ curly braces (extension bad in GTK)
│ \u2502 box drawing vertical line (see above)
⎮ \u23ae ⌠ ⎮ ⌡ intergral sign
⎟ \u239f ⎞ ⎟ ⎠ right parenthesis
⎥ \u23a5 ⎤ ⎥ ⎦ right square bracket
⎹ \u23b9 right box line (does not work well)
▕ \u2595 1/8 block right
⏐ \u23d0 vertical line extension (missing in X window fonts)
△ ▵ ▲ ▴ ∆ ⋏ ∧ ⊤ ⟙
│ │ │ │ │ │ │ │ │ <-- using \u2502 box drawing vert line
▽ ▿ ▼ ▾ ∇ ⋎ ∨ ⊥ ⟘ or use '┃' \u2503 for a bolder line
Just about all the centered vertical lines work for vertical arrows.
No problems for X windows "fixed" font (as always).
GTK "Monospace" works okay, but "Terminus: has proportional width faults
unless you are using "gvim" which 'squares' all characters anyway.
Upside Down Letters:
NB: Characters are from all over the unicode, some are not always available
Z⅄XMᴧ∩⊥SᴚΌԀOᴎW⅂⋊ſIH⅁ℲƎ◖Ↄ𐐒∀
zʎxʍʌnʇsɹbdouɯƖʞɾᴉɥƃɟǝpɔqɐ
068ㄥ9ϛㄣƐᄅƖ
˙ ʻ ؛ ¿ ¡ „ ⅋
Alternatives
i -> ı ᴉ
l -> Ɩ ʅ
2 -> Z ᄅ
G -> ⅁ פ
. -> ˙⠐ ° ˚
, -> ' ʻ ‘
Example Phrases
¡ɐᴉlɐɹʇsn∀ ʻɹǝpun uʍop ɯoɹɟ sƃuᴉʇǝǝɹ⅁
¡ɹǝpun uʍop ɯoɹɟ ʻʎɐp,ɐ⅁
¡sǝᴉqɯoz noʎ ƖƖɐ ʎǝɥ ʎǝH
˙˙˙unɟ ǝʌɐH ˙sǝıqɯoz noʎ ƖƖɐ ʇɥƃıu poo⅁
Converters...
http://www.upsidedowntext.com/
https://fsymbols.com/generators/aboqe-flip/
https://www.fileformat.info/convert/text/upside-down.htm
https://www.fileformat.info/convert/text/upside-down-map.htm
Other types of text substitution converters....
https://fsymbols.com/generators/wavy
https://fsymbols.com/generators/zalgo
https://fsymbols.com/generators/carty/
https://fsymbols.com/generators/smallcaps/
...
-------------------------------------------------------------------------------
Unicode Block Tables...
Tolkan Runes U+16A0
ᚠ ᚡ ᚢ ᚣ ᚤ ᚥ ᚦ ᚧ ᚨ ᚩ ᚪ ᚫ ᚬ ᚭ ᚮ ᚯ
ᚰ ᚱ ᚲ ᚳ ᚴ ᚵ ᚶ ᚷ ᚸ ᚹ ᚺ ᚻ ᚼ ᚽ ᚾ ᚿ
ᛀ ᛁ ᛂ ᛃ ᛄ ᛅ ᛆ ᛇ ᛈ ᛉ ᛊ ᛋ ᛌ ᛍ ᛎ ᛏ
ᛐ ᛑ ᛒ ᛓ ᛔ ᛕ ᛖ ᛗ ᛘ ᛙ ᛚ ᛛ ᛜ ᛝ ᛞ ᛟ
ᛠ ᛡ ᛢ ᛣ ᛤ ᛥ ᛦ ᛧ ᛨ ᛩ ᛪ ᛫ ᛬ ᛭ ᛮ ᛯ
Punctuation U+2000
‐ ‑ ‒ – — ― ‖ ‗ ‘ ’ ‚ ‛ “ ” „ ‟
† ‡ • ‣ ․ ‥ … ‧
‰ ‱ ′ ″ ‴ ‵ ‶ ‷ ‸ ‹ › ※ ‼ ‽ ‾ ‿
⁀ ⁁ ⁂ ⁃ ⁄ ⁅ ⁆ ⁇ ⁈ ⁉ ⁊ ⁋ ⁌ ⁍ ⁎ ⁏
⁐ ⁑ ⁒ ⁗
Arrows U+2190
← ↑ → ↓ ↔ ↕ ↖ ↗ ↘ ↙ ↚ ↛ ↜ ↝ ↞ ↟
↠ ↡ ↢ ↣ ↤ ↥ ↦ ↧ ↨ ↩ ↪ ↫ ↬ ↭ ↮ ↯
↰ ↱ ↲ ↳ ↴ ↵ ↶ ↷ ↸ ↹ ↺ ↻ ↼ ↽ ↾ ↿
⇀ ⇁ ⇂ ⇃ ⇄ ⇅ ⇆ ⇇ ⇈ ⇉ ⇊ ⇋ ⇌ ⇍ ⇎ ⇏
⇐ ⇑ ⇒ ⇓ ⇔ ⇕ ⇖ ⇗ ⇘ ⇙ ⇚ ⇛ ⇜ ⇝ ⇞ ⇟
⇠ ⇡ ⇢ ⇣ ⇤ ⇥ ⇦ ⇧ ⇨ ⇩ ⇪ ⇫ ⇬ ⇭ ⇮ ⇯
⇰ ⇱ ⇲ ⇳ ⇴ ⇵ ⇶ ⇷ ⇸ ⇹ ⇺ ⇻ ⇼ ⇽ ⇾ ⇿
Dingbat Arrows (U+2790)
➔ ➘ ➙ ➚ ➛ ➜ ➝ ➞ ➟ ➠ ➡ ➢ ➣
➤ ➥ ➦ ➧ ➨ ➩ ➪ ➫ ➬ ➭ ➮ ➯ ➱
➳ ➴ ➵ ➶ ➷ ➸ ➹ ➺ ➻ ➼ ➽ ➾ ➿
Supplement-A (U+27F0)
⟰ ⟱ ⟲ ⟳ ⟴ ⟵ ⟶ ⟷ ⟸ ⟹ ⟺ ⟻ ⟼ ⟽ ⟾ ⟿
Supplement-B (U+2900, not in X fonts)
⤀ ⤁ ⤂ ⤃ ⤄ ⤅ ⤆ ⤇ ⤈ ⤉ ⤊ ⤋ ⤌ ⤍ ⤎ ⤏
⤐ ⤑ ⤒ ⤓ ⤔ ⤕ ⤖ ⤗ ⤘ ⤙ ⤚ ⤛ ⤜ ⤝ ⤞ ⤟
⤠ ⤡ ⤢ ⤣ ⤤ ⤥ ⤦ ⤧ ⤨ ⤩ ⤪ ⤫ ⤬ ⤭ ⤮ ⤯
⤰ ⤱ ⤲ ⤳ ⤴ ⤵ ⤶ ⤷ ⤸ ⤹ ⤺ ⤻ ⤼ ⤽ ⤾ ⤿
⥀ ⥁ ⥂ ⥃ ⥄ ⥅ ⥆ ⥇ ⥈ ⥉ ⥊ ⥋ ⥌ ⥍ ⥎ ⥏
⥐ ⥑ ⥒ ⥓ ⥔ ⥕ ⥖ ⥗ ⥘ ⥙ ⥚ ⥛ ⥜ ⥝ ⥞ ⥟
⥠ ⥡ ⥢ ⥣ ⥤ ⥥ ⥦ ⥧ ⥨ ⥩ ⥪ ⥫ ⥬ ⥭ ⥮ ⥯
⥰ ⥱ ⥲ ⥳ ⥴ ⥵ ⥶ ⥷ ⥸ ⥹ ⥺ ⥻ ⥼ ⥽ ⥾ ⥿
Arrows from other sets...
ᛎ ᛏ ↩ ↪ ↫ ↬ ⏎ ≺ ≻
◄ ► ◅ ▻ ◂ ▸ ◃ ▹ ⊲ ⊳ ⊴ ⊵ ◀ ▶ ◁ ▷ ▲ ▼ △ ▽
Diacritical Arrows (see Composing Characters below)
⃐ ⃑ ⃔ ⃕ ⃖ ⃗ ⃡ ⃪ | Protect from end of line space removal
See combining chars below
Mathematical U+2200:
∀ ∁ ∂ ∃ ∄ ∅ ∆ ∇ ∈ ∉ ∊ ∋ ∌ ∍ ∎ ∏
∐ ∑ − ∓ ∔ ∕ ∖ ∗ ∘ ∙ √ ∛ ∜ ∝ ∞ ∟
∠ ∡ ∢ ∣ ∤ ∥ ∦ ∧ ∨ ∩ ∪ ∫ ∬ ∭ ∮ ∯
∰ ∱ ∲ ∳ ∴ ∵ ∶ ∷ ∸ ∹ ∺ ∻ ∼ ∽ ∾ ∿
≀ ≁ ≂ ≃ ≄ ≅ ≆ ≇ ≈ ≉ ≊ ≋ ≌ ≍ ≎ ≏
≐ ≑ ≒ ≓ ≔ ≕ ≖ ≗ ≘ ≙ ≚ ≛ ≜ ≝ ≞ ≟
≠ ≡ ≢ ≣ ≤ ≥ ≦ ≧ ≨ ≩ ≪ ≫ ≬ ≭ ≮ ≯
≰ ≱ ≲ ≳ ≴ ≵ ≶ ≷ ≸ ≹ ≺ ≻ ≼ ≽ ≾ ≿
⊀ ⊁ ⊂ ⊃ ⊄ ⊅ ⊆ ⊇ ⊈ ⊉ ⊊ ⊋ ⊌ ⊍ ⊎ ⊏
⊐ ⊑ ⊒ ⊓ ⊔ ⊕ ⊖ ⊗ ⊘ ⊙ ⊚ ⊛ ⊜ ⊝ ⊞ ⊟
⊠ ⊡ ⊢ ⊣ ⊤ ⊥ ⊦ ⊧ ⊨ ⊩ ⊪ ⊫ ⊬ ⊭ ⊮ ⊯
⊰ ⊱ ⊲ ⊳ ⊴ ⊵ ⊶ ⊷ ⊸ ⊹ ⊺ ⊻ ⊼ ⊽ ⊾ ⊿
⋀ ⋁ ⋂ ⋃ ⋄ ⋅ ⋆ ⋇ ⋈ ⋉ ⋊ ⋋ ⋌ ⋍ ⋎ ⋏
⋐ ⋑ ⋒ ⋓ ⋔ ⋕ ⋖ ⋗ ⋘ ⋙ ⋚ ⋛ ⋜ ⋝ ⋞ ⋟
⋠ ⋡ ⋢ ⋣ ⋤ ⋥ ⋦ ⋧ ⋨ ⋩ ⋪ ⋫ ⋬ ⋭ ⋮ ⋯
⋰ ⋱ ⋲ ⋳ ⋴ ⋵ ⋶ ⋷ ⋸ ⋹ ⋺ ⋻ ⋼ ⋽ ⋾ ⋿
Math Supplemental U+2A00
⨀ ⨁ ⨂ ⨃ ⨄ ⨅ ⨆ ⨇ ⨈ ⨉ ⨊ ⨋ ⨌ ⨍ ⨎ ⨏
⨐ ⨑ ⨒ ⨓ ⨔ ⨕ ⨖ ⨗ ⨘ ⨙ ⨚ ⨛ ⨜ ⨝
Technical U+2300:
⌀ ⌁ ⌂ ⌃ ⌄ ⌅ ⌆ ⌇ ⌈ ⌉ ⌊ ⌋ ⌌ ⌍ ⌎ ⌏
⌐ ⌑ ⌒ ⌓ ⌔ ⌕ ⌖ ⌗ ⌘ ⌙ ⌚ ⌛ ⌜ ⌝ ⌞ ⌟
⌠ ⌡ ⌢ ⌣ ⌤ ⌥ ⌦ ⌧ ⌨ 〈 〉 ⌫ ⌬ ⌭ ⌮ ⌯
⌰ ⌱ ⌲ ⌳ ⌴ ⌵ ⌶ ⌷ ⌸ ⌹ ⌺ ⌻ ⌼ ⌽ ⌾ ⌿
⍀ ⍁ ⍂ ⍃ ⍄ ⍅ ⍆ ⍇ ⍈ ⍉ ⍊ ⍋ ⍌ ⍍ ⍎ ⍏
⍐ ⍑ ⍒ ⍓ ⍔ ⍕ ⍖ ⍗ ⍘ ⍙ ⍚ ⍛ ⍜ ⍝ ⍞ ⍟
⍠ ⍡ ⍢ ⍣ ⍤ ⍥ ⍦ ⍧ ⍨ ⍩ ⍪ ⍫ ⍬ ⍭ ⍮ ⍯
⍰ ⍱ ⍲ ⍳ ⍴ ⍵ ⍶ ⍷ ⍸ ⍹ ⍺ ⍻ ⍼ ⍽ ⍾ ⍿
⎀ ⎁ ⎂ ⎃ ⎄ ⎅ ⎆ ⎇ ⎈ ⎉ ⎊ ⎋ ⎌ ⎍ ⎎ ⎏
⎐ ⎑ ⎒ ⎓ ⎔ ⎕ ⎖ ⎗ ⎘ ⎙ ⎚ ⎛ ⎜ ⎝ ⎞ ⎟
⎠ ⎡ ⎢ ⎣ ⎤ ⎥ ⎦ ⎧ ⎨ ⎩ ⎪ ⎫ ⎬ ⎭ ⎮ ⎯
⎰ ⎱ ⎲ ⎳ ⎴ ⎵ ⎶ ⎷ ⎸ ⎹ ⎺ ⎻ ⎼ ⎽ ⎾ ⎿
⏀ ⏁ ⏂ ⏃ ⏄ ⏅ ⏆ ⏇ ⏈ ⏉ ⏊ ⏋ ⏌ ⏍ ⏎ ⏏
⏐ ⏑ ⏒ ⏓ ⏔ ⏕ ⏖ ⏗ ⏘ ⏙ ⏚ ⏛ ⏜ ⏝ ⏞ ⏟
⏠ ⏡ ⏢ ⏣ ⏤ ⏥ ⏦ ⏧ ⏨ ⏩ ⏪ ⏫ ⏬ ⏭ ⏮ ⏯
⏰ ⏱ ⏲ ⏳ ⏴ ⏵ ⏶ ⏷ ⏸ ⏹ ⏺ ⏻ ⏼ ⏽ ⏾ ⏿
Miscellaneous U+2400:
␀ ␁ ␂ ␃ ␄ ␅ ␆ ␇ ␈ ␉ ␊ ␋ ␌ ␍ ␎ ␏
␐ ␑ ␒ ␓ ␔ ␕ ␖ ␗ ␘ ␙ ␚ ␛ ␜ ␝ ␞ ␟
␠ ␡ ␢ ␣ ␤ ␥ ␦
⑀ ⑁ ⑂ ⑃ ⑄ ⑅ ⑆ ⑇ ⑈ ⑉ ⑊
① ② ③ ④ ⑤ ⑥ ⑦ ⑧ ⑨ ⑩ ⑪ ⑫ ⑬ ⑭ ⑮ ⑯
⑰ ⑱ ⑲ ⑳ ⑴ ⑵ ⑶ ⑷ ⑸ ⑹ ⑺ ⑻ ⑼ ⑽ ⑾ ⑿
⒀ ⒁ ⒂ ⒃ ⒄ ⒅ ⒆ ⒇ ⒈ ⒉ ⒊ ⒋ ⒌ ⒍ ⒎ ⒏
⒐ ⒑ ⒒ ⒓ ⒔ ⒕ ⒖ ⒗ ⒘ ⒙ ⒚ ⒛ ⒜ ⒝ ⒞ ⒟
⒠ ⒡ ⒢ ⒣ ⒤ ⒥ ⒦ ⒧ ⒨ ⒩ ⒪ ⒫ ⒬ ⒭ ⒮ ⒯
⒰ ⒱ ⒲ ⒳ ⒴ ⒵ Ⓐ Ⓑ Ⓒ Ⓓ Ⓔ Ⓕ Ⓖ Ⓗ Ⓘ Ⓙ
Ⓚ Ⓛ Ⓜ Ⓝ Ⓞ Ⓟ Ⓠ Ⓡ Ⓢ Ⓣ Ⓤ Ⓥ Ⓦ Ⓧ Ⓨ Ⓩ
ⓐ ⓑ ⓒ ⓓ ⓔ ⓕ ⓖ ⓗ ⓘ ⓙ ⓚ ⓛ ⓜ ⓝ ⓞ ⓟ
ⓠ ⓡ ⓢ ⓣ ⓤ ⓥ ⓦ ⓧ ⓨ ⓩ ⓪ ⓫ ⓬ ⓭ ⓮ ⓯
⓰ ⓱ ⓲ ⓳ ⓴ ⓵ ⓶ ⓷ ⓸ ⓹ ⓺ ⓻ ⓼ ⓽ ⓾ ⓿
Graphics U+2500:
─ ━ │ ┃ ┄ ┅ ┆ ┇ ┈ ┉ ┊ ┋ ┌ ┍ ┎ ┏
┐ ┑ ┒ ┓ └ ┕ ┖ ┗ ┘ ┙ ┚ ┛ ├ ┝ ┞ ┟
┠ ┡ ┢ ┣ ┤ ┥ ┦ ┧ ┨ ┩ ┪ ┫ ┬ ┭ ┮ ┯
┰ ┱ ┲ ┳ ┴ ┵ ┶ ┷ ┸ ┹ ┺ ┻ ┼ ┽ ┾ ┿
╀ ╁ ╂ ╃ ╄ ╅ ╆ ╇ ╈ ╉ ╊ ╋ ╌ ╍ ╎ ╏
═ ║ ╒ ╓ ╔ ╕ ╖ ╗ ╘ ╙ ╚ ╛ ╜ ╝ ╞ ╟
╠ ╡ ╢ ╣ ╤ ╥ ╦ ╧ ╨ ╩ ╪ ╫ ╬ ╭ ╮ ╯
╰ ╱ ╲ ╳ ╴ ╵ ╶ ╷ ╸ ╹ ╺ ╻ ╼ ╽ ╾ ╿
▀ ▁ ▂ ▃ ▄ ▅ ▆ ▇ █ ▉ ▊ ▋ ▌ ▍ ▎ ▏
▐ ░ ▒ ▓ ▔ ▕ ▖ ▗ ▘ ▙ ▚ ▛ ▜ ▝ ▞ ▟
■ □ ▢ ▣ ▤ ▥ ▦ ▧ ▨ ▩ ▪ ▫ ▬ ▭ ▮ ▯
▰ ▱ ▲ △ ▴ ▵ ▶ ▷ ▸ ▹ ► ▻ ▼ ▽ ▾ ▿
◀ ◁ ◂ ◃ ◄ ◅ ◆ ◇ ◈ ◉ ◊ ○ ◌ ◍ ◎ ●
◐ ◑ ◒ ◓ ◔ ◕ ◖ ◗ ◘ ◙ ◚ ◛ ◜ ◝ ◞ ◟
◠ ◡ ◢ ◣ ◤ ◥ ◦ ◧ ◨ ◩ ◪ ◫ ◬ ◭ ◮ ◯
◰ ◱ ◲ ◳ ◴ ◵ ◶ ◷ ◸ ◹ ◺ ◻ ◼ ◽ ◾ ◿
Miscellaneous Symbols U+2600:
☀ ☁ ☂ ☃ ☄ ★ ☆ ☇ ☈ ☉ ☊ ☋ ☌ ☍ ☎ ☏
☐ ☑ ☒ ☓ ☔ ☕ ☖ ☗ ☘ ☙ ☚ ☛ ☜ ☝ ☞ ☟
☠ ☡ ☢ ☣ ☤ ☥ ☦ ☧ ☨ ☩ ☪ ☫ ☬ ☭ ☮ ☯
☰ ☱ ☲ ☳ ☴ ☵ ☶ ☷ ☸ ☹ ☺ ☻ ☼ ☽ ☾ ☿
♀ ♁ ♂ ♃ ♄ ♅ ♆ ♇ ♈ ♉ ♊ ♋ ♌ ♍ ♎ ♏
♐ ♑ ♒ ♓ ♔ ♕ ♖ ♗ ♘ ♙ ♚ ♛ ♜ ♝ ♞ ♟
♠ ♡ ♢ ♣ ♤ ♥ ♦ ♧ ♨ ♩ ♪ ♫ ♬ ♭ ♮ ♯
Dingbats U+2700:
Many of the original are defined elsewhere:
✁ ✂ ✃ ✄ ✆ ✇ ✈ ✉ ✌ ✍ ✎ ✏
✐ ✑ ✒ ✓ ✔ ✕ ✖ ✗ ✘ ✙ ✚ ✛ ✜ ✝ ✞ ✟
✠ ✡ ✢ ✣ ✤ ✥ ✦ ✧ ✩ ✪ ✫ ✬ ✭ ✮ ✯
✰ ✱ ✲ ✳ ✴ ✵ ✶ ✷ ✸ ✹ ✺ ✻ ✼ ✽ ✾ ✿
❀ ❁ ❂ ❃ ❄ ❅ ❆ ❇ ❈ ❉ ❊ ❋ ❍ ❏
❐ ❑ ❒ ❖ ❘ ❙ ❚ ❛ ❜ ❝ ❞
❡ ❢ ❣ ❤ ❥ ❦ ❧ ❨ ❩ ❪ ❫ ❬ ❭ ❮ ❯
❰ ❱ ❲ ❳ ❴ ❵ ❶ ❷ ❸ ❹ ❺ ❻ ❼ ❽ ❾ ❿
➀ ➁ ➂ ➃ ➄ ➅ ➆ ➇ ➈ ➉ ➊ ➋ ➌ ➍ ➎ ➏
➐ ➑ ➒ ➓ ➔ ➘ ➙ ➚ ➛ ➜ ➝ ➞ ➟
➠ ➡ ➢ ➣ ➤ ➥ ➦ ➧ ➨ ➩ ➪ ➫ ➬ ➭ ➮ ➯
➱ ➲ ➳ ➴ ➵ ➶ ➷ ➸ ➹ ➺ ➻ ➼ ➽ ➾ ➿
⟀ ⟁ ⟂ ⟃ ⟄ ⟅ ⟆ ⟇ ⟈ ⟉ ⟊ ⟋ ⟌ ⟍ ⟎ ⟏
⟐ ⟑ ⟒ ⟓ ⟔ ⟕ ⟖ ⟗ ⟘ ⟙ ⟚ ⟛ ⟜ ⟝ ⟞ ⟟
⟠ ⟡ ⟢ ⟣ ⟤ ⟥ ⟦ ⟧ ⟨ ⟩ ⟪ ⟫ ⟬ ⟭ ⟮ ⟯
⟰ ⟱ ⟲ ⟳ ⟴ ⟵ ⟶ ⟷ ⟸ ⟹ ⟺ ⟻ ⟼ ⟽ ⟾ ⟿
Braille U+2800:
⠀ ⠁ ⠂ ⠃ ⠄ ⠅ ⠆ ⠇ ⠈ ⠉ ⠊ ⠋ ⠌ ⠍ ⠎ ⠏
⠐ ⠑ ⠒ ⠓ ⠔ ⠕ ⠖ ⠗ ⠘ ⠙ ⠚ ⠛ ⠜ ⠝ ⠞ ⠟
⠠ ⠡ ⠢ ⠣ ⠤ ⠥ ⠦ ⠧ ⠨ ⠩ ⠪ ⠫ ⠬ ⠭ ⠮ ⠯
⠰ ⠱ ⠲ ⠳ ⠴ ⠵ ⠶ ⠷ ⠸ ⠹ ⠺ ⠻ ⠼ ⠽ ⠾ ⠿
⡀ ⡁ ⡂ ⡃ ⡄ ⡅ ⡆ ⡇ ⡈ ⡉ ⡊ ⡋ ⡌ ⡍ ⡎ ⡏
⡐ ⡑ ⡒ ⡓ ⡔ ⡕ ⡖ ⡗ ⡘ ⡙ ⡚ ⡛ ⡜ ⡝ ⡞ ⡟
⡠ ⡡ ⡢ ⡣ ⡤ ⡥ ⡦ ⡧ ⡨ ⡩ ⡪ ⡫ ⡬ ⡭ ⡮ ⡯
⡰ ⡱ ⡲ ⡳ ⡴ ⡵ ⡶ ⡷ ⡸ ⡹ ⡺ ⡻ ⡼ ⡽ ⡾ ⡿
⢀ ⢁ ⢂ ⢃ ⢄ ⢅ ⢆ ⢇ ⢈ ⢉ ⢊ ⢋ ⢌ ⢍ ⢎ ⢏
⢐ ⢑ ⢒ ⢓ ⢔ ⢕ ⢖ ⢗ ⢘ ⢙ ⢚ ⢛ ⢜ ⢝ ⢞ ⢟
⢠ ⢡ ⢢ ⢣ ⢤ ⢥ ⢦ ⢧ ⢨ ⢩ ⢪ ⢫ ⢬ ⢭ ⢮ ⢯
⢰ ⢱ ⢲ ⢳ ⢴ ⢵ ⢶ ⢷ ⢸ ⢹ ⢺ ⢻ ⢼ ⢽ ⢾ ⢿
⣀ ⣁ ⣂ ⣃ ⣄ ⣅ ⣆ ⣇ ⣈ ⣉ ⣊ ⣋ ⣌ ⣍ ⣎ ⣏
⣐ ⣑ ⣒ ⣓ ⣔ ⣕ ⣖ ⣗ ⣘ ⣙ ⣚ ⣛ ⣜ ⣝ ⣞ ⣟
⣠ ⣡ ⣢ ⣣ ⣤ ⣥ ⣦ ⣧ ⣨ ⣩ ⣪ ⣫ ⣬ ⣭ ⣮ ⣯
⣰ ⣱ ⣲ ⣳ ⣴ ⣵ ⣶ ⣷ ⣸ ⣹ ⣺ ⣻ ⣼ ⣽ ⣾ ⣿
Character Code (in hex) =
U+2800 + 1 8
2 10
4 20
40 80
so the lower four dots is (in hex) =
U+2800 + 40 + 80 + 4 + 20 => U+28E4 => ⣤
Miscelanious X window defined glyphs
               
         
               
               
               
ff fi fl ffi ffl ſt st ﬓ ﬔ ﬕ ﬖ ﬗ יִ ﬞ ײַ
ﬠ ﬡ ﬢ ﬣ ﬤ ﬥ ﬦ ﬧ ﬨ ﬩ שׁ שׂ שּׁ שּׂ אַ אָ
אּ בּ גּ דּ הּ וּ זּ טּ יּ ךּ כּ לּ מּ
נּ סּ ףּ פּ צּ קּ רּ שּ תּ וֹ בֿ כֿ פֿ ﭏ
ﭖ ﭗ ﭘ ﭙ ﭪ ﭫ ﭬ ﭭ ﭺ ﭻ ﭼ ﭽ
ﮆ ﮇ ﮎ ﮏ ﮐ ﮑ ﮒ ﮓ ﮔ ﮕ ﯼ ﯽ ﯾ ﯿ
ﹰ ﹲ ﹴ ﹶ ﹸ ﹺ ﹼ ﹽ ﹾ
ﺀ ﺁ ﺂ ﺃ ﺄ ﺅ ﺆ ﺇ ﺈ ﺉ ﺊ ﺋ ﺌ ﺍ ﺎ ﺏ
ﺐ ﺑ ﺒ ﺓ ﺔ ﺕ ﺖ ﺗ ﺘ ﺙ ﺚ ﺛ ﺜ ﺝ ﺞ ﺟ
ﺠ ﺡ ﺢ ﺣ ﺤ ﺥ ﺦ ﺧ ﺨ ﺩ ﺪ ﺫ ﺬ ﺭ ﺮ ﺯ
ﺰ ﺱ ﺲ ﺳ ﺴ ﺵ ﺶ ﺷ ﺸ ﺹ ﺺ ﺻ ﺼ ﺽ ﺾ ﺿ
ﻀ ﻁ ﻂ ﻃ ﻄ ﻅ ﻆ ﻇ ﻈ ﻉ ﻊ ﻋ ﻌ ﻍ ﻎ ﻏ
ﻐ ﻑ ﻒ ﻓ ﻔ ﻕ ﻖ ﻗ ﻘ ﻙ ﻚ ﻛ ﻜ ﻝ ﻞ ﻟ
ﻠ ﻡ ﻢ ﻣ ﻤ ﻥ ﻦ ﻧ ﻨ ﻩ ﻪ ﻫ ﻬ ﻭ ﻮ ﻯ
ﻰ ﻱ ﻲ ﻳ ﻴ ﻵ ﻶ ﻷ ﻸ ﻹ ﻺ ﻻ ﻼ
Greek Alphabet
Α Β Γ Δ Ε Π Τ
α β γ δ ε π τ
Specials Block U+FFF0:
 \uFFF9 Annotation Anchor
 \uFFFA Annotation Separator
 \uFFFB Annotation Terminator
 \uFFFC Replacement Object (placeholder for unspecified document)
� \uFFFD Replacement character (the official not-defined character)
￾ ￿ \uFFFE, \uFFFF not a character (generally something is wrong!)
The most important character in this block is � \uFFFD
And is rendered as a filled diamond with question mark.
Used to indicate a problem within the Unicode stream,
such as display a windows code page as Unicode.
-------------------------------------------------------------------------------
Composing Characters...
Diacritical Marks, are characters that accent the previous character.
Generally you have a main character then a combining character which overlays
on the previous character. Some characters are pre-combined to provide
direct compatibility with the older ISO8859 fonts.
A + Diaeresis (u0308): Ä
PreCombined (u00C4): Ä
Combining Characters tend to fail in unexpected ways. With marks appearing
over the next character (Chrome), or not centered over/below the previous
character. XTerms seem to work the best.
Note that the Thai Script needs up to two combining characters
over a single base character.
Examples...
STARGΛ̊TE SG-1, a = v̇ = r̈, a⃑ ⊥ b⃑
.⃝ ⋅⃝ ∘⃝ •⃝ •⃟ •⃞ ▫⃞ X⃞ ╳⃞ ✔⃞ v⃤ •⃕ ∘͎ ∘̭̌ •̬̂ ◇̬̂ °⃘̊
Diacritical Mark Blocks (formatted over a space)
Combining Diacritical Marks U+0300 - U+036F
̃ ̄ ̅ ̂ ̌ ̑ ̆ ̐ ̀ ́ ̇ ̈ ͡ ̚ ̊ | Protect from end of line space removal
̴ ̵ ̶ ̷ ̸ | as these are all combined with a space!
̱ ̲ ̭ ̬ ̯ ̮ ͎ ͢ ͜ |
Non-combining Diacritical Marks U+02B0 - U+02FF
˘ ˙ ˚ ˜ ˟ ˆ ˇ
Combining Diacritical Marks for Symbols U+20D0 - U+20FF
⃐ ⃑ ⃒ ⃓ ⃔ ⃕ ⃖ ⃗ ⃘ ⃙ ⃚ ⃛ ⃜ | Protect from end of line space removal
⃝ ⃞ ⃟ ⃠ ⃡ ⃢ ⃣ ⃤ ⃥ ⃦ ⃧ ⃨ ⃩ ⃪ | Works in "vim" but in little else
Variation Selector...
When fonts contain both Text and Emoji variants, some symbols are in both
The symbols generally have a preference for what it should be displayed as,
but a variation selector prefix can change that preference, when this is
a problem. This is generally needed for web rendering, in terminals the
indicator is not understood at this time and comes out as a unknown
composing character.
U+FE0E indicator for text rendering
U+FE0F indicator for emoji rendering
Example... ↔ is a math symbol.
But some web browsers will prefer to use the emoji variant! That means some
mathematical formulas simply do not render as it was originally intended.
See http://xahlee.info/comp/text_vs_emoji.html
-----------------------------------------------------------------------------
Simple ASCII Art...
These often make big use of Diacritical Marks and as such often fail
in spectacular ways.
(°͡ʖ͜°͡) "Lenny Face" XTerm works.
( ͡° ͜ʖ ͡°) For GTK fonts and web browsers (proportional fonts)
(⟃ ͜ʖ ⟄) Goggly Eyes
¯\_(ツ)_/¯ The double-wide face fails in XTerms
ӽd̲̅a̲̅r̲̅w̲̅i̲̅ɳ̲̅ᕗ ӽe̲̅v̲̅o̲̅l̲̅u̲̅t̲̅i̲̅o̲̅ɳ̲̅ᕗ Darwin evolution fish
┌∩┐(◣_◢)┌∩┐ Monster
(´סּ︵סּ`) Sad
(¬_¬) Meh
ლ(ಠ益ಠ)ლ Angry
•͡˘㇁•͡˘ Face (not in xterm, gedit, or chrome)
(◔/‿\◔) Up Face
Circle of Hats (chrome works, not in xterm) - lost
__̴ı̴̴̡̡̡ ̡͌l̡̡̡ ̡͌l̡*̡̡ ̴̡ı̴̴̡ ̡̡͡|̲̲̲͡͡͡ ̲▫̲͡ ̲̲̲͡͡π̲̲͡͡ ̲̲͡▫̲̲͡͡ ̲|̡̡̡ ı̴̡̡ ̡͌l̡̡̡̡.___
Landscape (XTerms only, not Gedit or chrome)
»-(¯`.´¯)-> Arrow in Heart
[{-_-}] ZZZzz zz z... Sleep
龴ↀ◡ↀ龴 Cat
ʕ•ᴥ•ʔ Bear
ᶘ ᵒᴥᵒᶅ ᶘᵒᴥᵒᶅ pedobear (look right, left)
(♥_♥) Love Eyes
⊂(✰‿✰)つ Star Eyes
\(סּںסּَ` )/ۜ Yea!
(Ͼ˳Ͽ) big eyes
∙،°. ˘Ô≈ôﺣ Racing Car
ℓ٥ﻻ ﻉ√٥υ Love You Script
[̲̅$̲̅(̲̅ιοο̲̅)̲̅$̲̅] Money (xterm only)
~(‾▿‾)~ Bird
-`ღ´- Sparklingly heart
︻╦̵̵͇̿̿̿̿══╤─ Rifle
┌ಠ_ಠ)┌∩┐ ᶠᶸᶜᵏ♥ᵧₒᵤ Fuck You (chrome, not Xterms)
-----------------------------------------------------
Language Examples...
APL:
((V⍳V)=⍳⍴V)/V←,V ⌷←⍳→⍴∆∇⊃‾⍎⍕⌈
Linguistics and dictionaries:
ði ıntəˈnæʃənəl fəˈnɛtık əsoʊsiˈeıʃn
Y [ˈʏpsilɔn], Yen [jɛn], Yoga [ˈjoːgɑ]
Some Chinese (double width characters)
测试用的汉字
Greek (in Polytonic):
From a speech of Demosthenes in the 4th century BC:
Οὐχὶ ταὐτὰ παρίσταταί μοι γιγνώσκειν, ὦ ἄνδρες ᾿Αθηναῖοι,
ὅταν τ᾿ εἰς τὰ πράγματα ἀποβλέψω καὶ ὅταν πρὸς τοὺς
λόγους οὓς ἀκούω· τοὺς μὲν γὰρ λόγους περὶ τοῦ
τιμωρήσασθαι Φίλιππον ὁρῶ γιγνομένους, τὰ δὲ πράγματ᾿
εἰς τοῦτο προήκοντα, ὥσθ᾿ ὅπως μὴ πεισόμεθ᾿ αὐτοὶ
πρότερον κακῶς σκέψασθαι δέον. οὐδέν οὖν ἄλλο μοι δοκοῦσιν
οἱ τὰ τοιαῦτα λέγοντες ἢ τὴν ὑπόθεσιν, περὶ ἧς βουλεύεσθαι,
οὐχὶ τὴν οὖσαν παριστάντες ὑμῖν ἁμαρτάνειν. ἐγὼ δέ, ὅτι μέν
ποτ᾿ ἐξῆν τῇ πόλει καὶ τὰ αὑτῆς ἔχειν ἀσφαλῶς καὶ Φίλιππον
τιμωρήσασθαι, καὶ μάλ᾿ ἀκριβῶς οἶδα· ἐπ᾿ ἐμοῦ γάρ, οὐ πάλαι
γέγονεν ταῦτ᾿ ἀμφότερα· νῦν μέντοι πέπεισμαι τοῦθ᾿ ἱκανὸν
προλαβεῖν ἡμῖν εἶναι τὴν πρώτην, ὅπως τοὺς συμμάχους
σώσομεν. ἐὰν γὰρ τοῦτο βεβαίως ὑπάρξῃ, τότε καὶ περὶ τοῦ
τίνα τιμωρήσεταί τις καὶ ὃν τρόπον ἐξέσται σκοπεῖν· πρὶν δὲ
τὴν ἀρχὴν ὀρθῶς ὑποθέσθαι, μάταιον ἡγοῦμαι περὶ τῆς
τελευτῆς ὁντινοῦν ποιεῖσθαι λόγον.
Δημοσθένους, Γ´ ᾿Ολυνθιακὸς
Georgian:
From a Unicode conference invitation:
გთხოვთ ახლავე გაიაროთ რეგისტრაცია Unicode-ის მეათე საერთაშორისო
კონფერენციაზე დასასწრებად, რომელიც გაიმართება 10-12 მარტს,
ქ. მაინცში, გერმანიაში. კონფერენცია შეჰკრებს ერთად მსოფლიოს
ექსპერტებს ისეთ დარგებში როგორიცაა ინტერნეტი და Unicode-ი,
ინტერნაციონალიზაცია და ლოკალიზაცია, Unicode-ის გამოყენება
ოპერაციულ სისტემებსა, და გამოყენებით პროგრამებში, შრიფტებში,
ტექსტების დამუშავებასა და მრავალენოვან კომპიუტერულ სისტემებში.
Russian:
From a Unicode conference invitation:
Зарегистрируйтесь сейчас на Десятую Международную Конференцию по
Unicode, которая состоится 10-12 марта 1997 года в Майнце в Германии.
Конференция соберет широкий круг экспертов по вопросам глобального
Интернета и Unicode, локализации и интернационализации, воплощению и
применению Unicode в различных операционных системах и программных
приложениях, шрифтах, верстке и многоязычных компьютерных системах.
Thai (UCS Level 2):
Excerpt from a poetry on The Romance of The Three Kingdoms
(a Chinese classic 'San Gua'):
[----------------------------|------------------------]
๏ แผ่นดินฮั่นเสื่อมโทรมแสนสังเวช พระปกเกศกองบู๊กู้ขึ้นใหม่
สิบสองกษัตริย์ก่อนหน้าแลถัดไป สององค์ไซร้โง่เขลาเบาปัญญา
ทรงนับถือขันทีเป็นที่พึ่ง บ้านเมืองจึงวิปริตเป็นนักหนา
โฮจิ๋นเรียกทัพทั่วหัวเมืองมา หมายจะฆ่ามดชั่วตัวสำคัญ
เหมือนขับไสไล่เสือจากเคหา รับหมาป่าเข้ามาเลยอาสัญ
ฝ่ายอ้องอุ้นยุแยกให้แตกกัน ใช้สาวนั้นเป็นชนวนชื่นชวนใจ
พลันลิฉุยกุยกีกลับก่อเหตุ ช่างอาเพศจริงหนาฟ้าร้องไห้
ต้องรบราฆ่าฟันจนบรรลัย ฤๅหาใครค้ำชูกู้บรรลังก์ ฯ
(The above is a two-column text. If combining characters are handled
correctly, the lines of the second column should be aligned with the
'|' character above.)
Ethiopian:
Proverbs in the Amharic language:
ሰማይ አይታረስ ንጉሥ አይከሰስ።
ብላ ካለኝ እንደአባቴ በቆመጠኝ።
ጌጥ ያለቤቱ ቁምጥና ነው።
ደሀ በሕልሙ ቅቤ ባይጠጣ ንጣት በገደለው።
የአፍ ወለምታ በቅቤ አይታሽም።
አይጥ በበላ ዳዋ ተመታ።
ሲተረጉሙ ይደረግሙ።
ቀስ በቀስ፥ ዕንቁላል በእግሩ ይሄዳል።
ድር ቢያብር አንበሳ ያስር።
ሰው እንደቤቱ እንጅ እንደ ጉረቤቱ አይተዳደርም።
እግዜር የከፈተውን ጉሮሮ ሳይዘጋው አይድርም።
የጎረቤት ሌባ፥ ቢያዩት ይስቅ ባያዩት ያጠልቅ።
ሥራ ከመፍታት ልጄን ላፋታት።
ዓባይ ማደሪያ የለው፥ ግንድ ይዞ ይዞራል።
የእስላም አገሩ መካ የአሞራ አገሩ ዋርካ።
ተንጋሎ ቢተፉ ተመልሶ ባፉ።
ወዳጅህ ማር ቢሆን ጨርስህ አትላሰው።
እግርህን በፍራሽህ ልክ ዘርጋ።
Runes:
ᚻᛖ ᚳᚹᚫᚦ ᚦᚫᛏ ᚻᛖ ᛒᚢᛞᛖ ᚩᚾ ᚦᚫᛗ ᛚᚪᚾᛞᛖ ᚾᚩᚱᚦᚹᛖᚪᚱᛞᚢᛗ ᚹᛁᚦ ᚦᚪ ᚹᛖᛥᚫ
(Old English, which transcribed into Latin reads
'He cwaeth that he bude thaem lande northweardum with tha Westsae.'
or translated to modern english
'He said that he lived in the northern land near the Western Sea.')
Braille:
⡌⠁⠧⠑ ⠼⠁⠒ ⡍⠜⠇⠑⠹⠰⠎ ⡣⠕⠌
⡍⠜⠇⠑⠹ ⠺⠁⠎ ⠙⠑⠁⠙⠒ ⠞⠕ ⠃⠑⠛⠔ ⠺⠊⠹⠲ ⡹⠻⠑ ⠊⠎ ⠝⠕ ⠙⠳⠃⠞
⠱⠁⠞⠑⠧⠻ ⠁⠃⠳⠞ ⠹⠁⠞⠲ ⡹⠑ ⠗⠑⠛⠊⠌⠻ ⠕⠋ ⠙⠊⠎ ⠃⠥⠗⠊⠁⠇ ⠺⠁⠎
⠎⠊⠛⠝⠫ ⠃⠹ ⠹⠑ ⠊⠇⠻⠛⠹⠍⠁⠝⠂ ⠹⠑ ⠊⠇⠻⠅⠂ ⠹⠑ ⠥⠝⠙⠻⠞⠁⠅⠻⠂
⠁⠝⠙ ⠹⠑ ⠡⠊⠑⠋ ⠍⠳⠗⠝⠻⠲ ⡎⠊⠗⠕⠕⠛⠑ ⠎⠊⠛⠝⠫ ⠊⠞⠲ ⡁⠝⠙
⡎⠊⠗⠕⠕⠛⠑⠰⠎ ⠝⠁⠍⠑ ⠺⠁⠎ ⠛⠕⠕⠙ ⠥⠏⠕⠝ ⠰⡡⠁⠝⠛⠑⠂ ⠋⠕⠗ ⠁⠝⠹⠹⠔⠛ ⠙⠑
⠡⠕⠎⠑ ⠞⠕ ⠏⠥⠞ ⠙⠊⠎ ⠙⠁⠝⠙ ⠞⠕⠲
(The first couple of paragraphs of "A Christmas Carol" by Dickens)
Greetings in various languages:
Hello world, Καλημέρα κόσμε, コンニチハ
-------------------------------------------------------------------------------
0 linked references to [{utf8-demo.txt}]:
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment