Skip to content

Instantly share code, notes, and snippets.

View SoniEx2's full-sized avatar

Soni L. SoniEx2

View GitHub Profile

Unicode Instruction Set Architecture (UISA)

Unicode (UTF-8) is the leading character encoding being used in computers today. However, a few factors hinder its adoption:

  • It's big. It's actually huge! You need to include (increasingly bigger) character tables in your programs if you want to deal with Unicode.
  • You can't really do anything without those character tables. Unicode is a complete mess without them!

So we shall define an ISA that replaces Unicode. Meet UISA!