Skip to content

Instantly share code, notes, and snippets.

@lmeulen
Last active June 29, 2023 05:28
Show Gist options
  • Save lmeulen/a3807c73abb45b65e9d9ba1d479a2d79 to your computer and use it in GitHub Desktop.
Save lmeulen/a3807c73abb45b65e9d9ba1d479a2d79 to your computer and use it in GitHub Desktop.
freetextprivacy_dates
def remove_dates(text):
text = re.sub("\d{2}[- /.]\d{2}[- /.]\d{,4}", "<DATUM> ", text)
text = re.sub(
"(\d{1,2}[^\w]{,2}(januari|februari|maart|april|mei|juni|juli|augustus"\
"|september|oktober|november|december)([- /.]{,2}(\d{4}|\d{2})){,1})"\
"(?P<n>\D)(?![^<]*>)", "<DATE> ", text)
text = re.sub(
"(\d{1,2}[^\w]{,2}(jan|feb|mrt|apr|mei|jun|jul|aug|sep|okt|nov|dec)"\
"([- /.]{,2}(\d{4}|\d{2})){,1})(?P<n>\D)(?![^<]*>)", "<DATE> ", text)
return text
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment