Skip to content

Instantly share code, notes, and snippets.

@fnielsen
Created September 21, 2014 22:50
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save fnielsen/b19d8ed4b731c6838c85 to your computer and use it in GitHub Desktop.
Save fnielsen/b19d8ed4b731c6838c85 to your computer and use it in GitHub Desktop.
String for tokenization
s = u"""DTU course 02819 is taught by Mr. Finn Årup Nielsen,
Ph.D. Some of aspects of the course are: machine learning and web
2.0. The telephone to Finn is (+45) 4525 3921, and his email is
faan@dtu.dk. A book published by O'Reilly called 'Programming
Collective Intelligence' might be useful. It costs $39.99 or 285.00
kroner in Polyteknisk Boghandle. Is 'Text Processing in Python'
appropriate for the course? Perhaps! The constructor function in
Python is called "__init__()". fMRI will not be a topic of the
course."""
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment