Skip to content

Instantly share code, notes, and snippets.

@mazieres
Last active September 2, 2016 12:51
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save mazieres/26216f58579d6288e0dd0dc48ae4332a to your computer and use it in GitHub Desktop.
Save mazieres/26216f58579d6288e0dd0dc48ae4332a to your computer and use it in GitHub Desktop.
# dict of conversion of google's charset to python codec name
enc_google_to_python = {
"windows-1257": "cp1257",
"GB2312 ": "gb2312",
"windows-874": "cp874",
"EUC-KR": "euc_kr",
"Shift_JIS": "shift_jis",
"UTF-8": "utf_8",
"ISO-8859-2": "iso8859_2",
"ISO-8859-9": "iso8859_9",
"Big5": "big5",
"windows-1251": "cp1251",
"ISO-8859-1": "latin_1",
"windows-1256": "cp1256",
"ISO-8859-7": "iso8859_7",
"windows-1255": "cp1255"
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment