Skip to content

Instantly share code, notes, and snippets.

@lucemia
Created October 17, 2013 14:30
Show Gist options
  • Save lucemia/7026005 to your computer and use it in GitHub Desktop.
Save lucemia/7026005 to your computer and use it in GitHub Desktop.
a simple way to extract chinese +english character
import re
re_pure_text = re.compile(ur'[\u4e00-\u9fff\w]+', re.UNICODE)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment