Skip to content

Instantly share code, notes, and snippets.

@nacho4d
Created June 5, 2012 11:10
Show Gist options
  • Star 1 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save nacho4d/2874422 to your computer and use it in GitHub Desktop.
Save nacho4d/2874422 to your computer and use it in GitHub Desktop.
Universal Charset Detector Results

Universal Charset Detector:

Possible Encodings:

  • "ibm850"

  • "ASCII"

  • "UTF-8"

  • "X-ISO-10646-UCS-4-3412"

  • "X-ISO-10646-UCS-4-2143"

  • "UTF-16BE"

  • "UTF-16LE"

  • "UTF-32BE"

  • "UTF-32LE"

Single Byte Charset:

Cyrilic

  • Koi8rModel: "KOI8-R"
  • Win1251Model: "windows-1251"
  • Latin5Model: "ISO-8859-5"
  • MacCyrillicModel: "x-mac-cyrillic"
  • Ibm866Model: "IBM866"
  • Ibm855Model: "IBM855"

Greek

  • Latin7Model: "ISO-8859-7"
  • Win1253Model: "windows-1253"

Bulgarian

  • Latin5BulgarianModel: "ISO-8859-5"
  • Win1251BulgarianModel: "windows-1251"

Hungarian

  • Latin2HungarianModel: "ISO-8859-2"
  • Win1250HungarianModel: "windows-1250"

Hebrew

  • Visual Hebrew: "ISO-8859-8"
  • Logical Hebrew: "windows-1255"

Multi Byte Charset

  • "Big5"
  • "EUC-JP"
  • "EUC-KR"
  • "x-euc-tw"
  • "gb18030" || "GB18030"
  • "Shift_JIS"
  • "UTF-8"

Latin1 Charset

  • "windows-1252"

Even More

  • "HZ-GB-2312"
  • "ISO-2022-CN"
  • "ISO-2022-JP"
  • "ISO-2022-KR"
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment