# convert iso-8859-1 to unicode to utf-8, where `v` is the string in `iso-8859-1` format
v.decode("iso-8859-1").encode("utf-8")
And as a note, this is also some basic rule:
If you have no way of finding out the correct encoding of the file, then try the following encodings, in this order:
utf-8
iso-8859-1 (also known as latin-1)
(This is the encoding of all census data and much other data produced by government entities.)
utf-16```
)Hi I have a tweet dataset and I am wondering how to convert the encoding. The text as an example 'People saying that he should be removed from : 1) Thatâ<U+0080><U+0099>s another movie. Two wrongs donâ<U+0080><U+0099>t make a right' . I have tried encode().decode(UTF-8) and nothing