Create a gist now

Instantly share code, notes, and snippets.

unescape html to python string
import urllib2
import sys
def cleanupString(string):
string = urllib2.unquote(string).decode('utf8')
return HTMLParser.HTMLParser().unescape(string).encode(sys.getfilesystemencoding())
@alexweber

Thanks, this was really useful! :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment