Skip to content

Instantly share code, notes, and snippets.

@markmysourcecode
markmysourcecode / normalize_string.py
Created May 30, 2023 08:11 — forked from tmacam/normalize_string.py
string normalization in python: HTML stripping and HTML entity resolution.
#!/usr/bin/python
# vim:ts=4:sts=4:sw=4:et:wrap:ai:fileencoding=utf-8:
"""A collection of string normalization routines.
You are probably looking for normalize_string, that does an aggressive (but
arguably sound) string normalization process.
"""