There are 4 forms of UTF-16:
- UTF-16BE
- UTF-16LE
- UTF-16LBE
- UTF-16LLE
The first two are described by the Unicode consortium. The latter 2 are described on this document.
The easiest way to explain UTF-16LLE is to describe its differences.
For documents that only use codepoints in the BMP region, UTF-16LE and UTF-16LLE are exactly the same.
For documents that use codepoints in the upper planes, the only difference is that the order of surrogate pairs is swapped, such that the lower surrogate comes first, and the upper surrogate comes second.
As with UTF-16LLE, the same principle applies: the only change is the order in which surrogate pairs appear.
Copyright (C) 2017 Soni L.
All rights reserved.
This document may be freely shared, copied, and distributed.