Skip to content

Instantly share code, notes, and snippets.

@alanorth
Created August 20, 2018 07:40
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save alanorth/cb80a5d4f860f9cbbdc6c1f8228b93e2 to your computer and use it in GitHub Desktop.
Save alanorth/cb80a5d4f860f9cbbdc6c1f8228b93e2 to your computer and use it in GitHub Desktop.
Open Refine custom text facet to detect weird Unicode characters that usually indicate some encoding error. This allows you to find the characters and fix them manually.
or(
isNotNull(value.match(/.*\uFFFD.*/)),
isNotNull(value.match(/.*\u00A0.*/)),
isNotNull(value.match(/.*\u200A.*/)),
isNotNull(value.match(/.*\u2019.*/)),
isNotNull(value.match(/.*\u00b4.*/))
)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment