Skip to content

Instantly share code, notes, and snippets.

@JelmerDeHen
Created February 14, 2019 01:34
Show Gist options
  • Save JelmerDeHen/eef170676d718e9305c0c80d1d4cbccd to your computer and use it in GitHub Desktop.
Save JelmerDeHen/eef170676d718e9305c0c80d1d4cbccd to your computer and use it in GitHub Desktop.
List of interesting unicode code points, extracted from the uni2ascii man page (https://www.billposer.org/Software/uni2ascii_man.html)
U+00A2 CENT SIGN -> cent
U+00A3 POUND SIGN -> pound
U+00A5 YEN SIGN -> yen
U+00A9 COPYRIGHT SYMBOL -> (c)
U+00AE REGISTERED SYMBOL -> (R)
U+00BC ONE QUARTER -> 1/4
U+00BD ONE HALF -> 1/2
U+00BE THREE QUARTERS -> 3/4
U+00C6 CAPITAL LETTER ASH -> AE
U+00DF SMALL LETTER SHARP S -> ss
U+00E6 SMALL LETTER ASH -> ae
U+0132 LIGATURE IJ -> IJ
U+0133 LIGATURE ij -> ij
U+0152 LIGATURE OE -> OE
U+0153 LIGATURE oe -> oe
U+01F1 CAPITAL LETTER DZ -> DZ
U+01F2 MIXED LETTER Dz -> Dz
U+01F3 SMALL LETTER DZ -> dz
U+02A6 SMALL LETTER TS DIGRAPH -> ts
U+2026 HORIZONTAL ELLIPSIS -> ...
U+20AC EURO SIGN -> euro
U+22EF MIDLINE HORIZONTAL ELLIPSIS -> ...
U+2190 LEFTWARDS ARROW -> <-
U+2192 RIGHTWARDS ARROW -> ->
U+21D0 LEFTWARDS DOUBLE ARROW -> <=
U+21D2 RIGHTWARDS DOUBLE ARROW -> =>
U+FB00 LATIN SMALL LIGATURE FF -> ff
U+FB01 LATIN SMALL LIGATURE FI -> fi
U+FB02 LATIN SMALL LIGATURE FL -> fl
U+FB03 LATIN SMALL LIGATURE FFI -> ffi
U+FB04 LATIN SMALL LIGATURE FFL -> ffl
U+FB06 LATIN SMALL LIGATURE ST -> st
U+00A2 CENT SIGN -> c
U+00A3 POUND SIGN -> #
U+00A5 YEN SIGN -> Y
U+00A9 COPYRIGHT SYMBOL -> C
U+00AE REGISTERED SYMBOL -> R
U+00BC ONE QUARTER -> -
U+00BD ONE HALF -> -
U+00BE THREE QUARTERS -> -
U+00C6 CAPITAL LETTER ASH -> A
U+00DF SMALL LETTER SHARP S -> s
U+00E6 SMALL LETTER ASH -> a
U+0132 LIGATURE IJ -> I
U+0133 LIGATURE ij -> i
U+0152 LIGATURE OE -> O
U+0153 LIGATURE oe -> o
U+01F1 CAPITAL LETTER DZ -> D
U+01F2 MIXED LETTER Dz -> D
U+01F3 SMALL LETTER DZ -> d
U+02A6 SMALL LETTER TS DIGRAPH -> t
U+2026 HORIZONTAL ELLIPSIS -> .
U+20AC EURO SIGN -> E
U+22EF MIDLINE HORIZONTAL ELLIPSIS -> .
U+2190 LEFTWARDS ARROW -> <
U+2192 RIGHTWARDS ARROW -> >
U+21D0 LEFTWARDS DOUBLE ARROW -> <
U+21D2 RIGHTWARDS DOUBLE ARROW -> >
U+0085 next line 0x0A newline
U+00A0 no break space 0x20 space
U+00AB left-pointing double angle quotation mark 0x22 double quote
U+00AD soft hyphen 0x2D minus
U+00AF macron 0x2D minus
U+00B7 middle dot 0x2E period
U+00BB right-pointing double angle quotation mark 0x22 double quote
U+1361 ethiopic word space 0x20 space
U+1680 ogham space 0x20 space
U+2000 en quad 0x20 space
U+2001 em quad 0x20 space
U+2002 en space 0x20 space
U+2003 em space 0x20 space
U+2004 three-per-em space 0x20 space
U+2005 four-per-em space 0x20 space
U+2006 six-per-em space 0x20 space
U+2007 figure space 0x20 space
U+2008 punctuation space 0x20 space
U+2009 thin space 0x20 space
U+200A hair space 0x20 space
U+200B zero-width space 0x20 space
U+2010 hyphen 0x2D minus
U+2011 non-breaking hyphen 0x2D minus
U+2012 figure dash 0x2D minus
U+2013 en dash 0x2D minus
U+2014 em dash 0x2D minus
U+2018 left single quotation mark 0x60 left single quote
U+2019 right single quotation mark 0x27 right or neutral single quote
U+201A single low-9 quotation mark 0x60 left single quote
U+201B single high-reversed-9 quotation mark 0x60 left single quote
U+201C left double quotation mark 0x22 double quote
U+201D right double quotation mark 0x22 double quote
U+201E double low-9 quotation mark 0x22 double quote
U+201F double high-reversed-9 quotation mark 0x22 double quote
U+2022 bullet 0x6F small letter o
U+2028 line separator 0x0A newline
U+2033 double prime 0x22 double quote
U+2039 single left-pointing angle quotation mark 0x60 left single quote
U+203A single right-pointing angle quotation mark 0x27 right or neutral single quote
U+204E low asterisk 0x2A asterisk
U+2212 minus sign 0x2D minus
U+2216 set minus 0x5C backslash
U+2217 asterisk operator 0x2A asterisk
U+2223 divides 0x7C vertical line
U+2500 box drawing light horizontal 0x2D minus
U+2501 box drawing heavy horizontal 0x2D minus
U+2502 box drawing light vertical 0x7C vertical line
U+2503 box drawing heavy vertical 0x7C vertical line
U+2731 heavy asterisk 0x2A asterisk
U+275D heavy double turned comma quotation mark 0x22 double quote
U+275E heavy double comma quotation mark 0x22 double quote
U+3000 ideographic space 0x20 space
U+FE60 small ampersand 0x26 ampersand
U+FE61 small asterisk 0x2A asterisk
U+FE62 small plus sign 0x2B plus sign
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment