Skip to content

Instantly share code, notes, and snippets.

@aparrish
Last active April 7, 2021 14:37
Show Gist options
  • Save aparrish/37f4661c2ece7fbda1916a84a85b28c9 to your computer and use it in GitHub Desktop.
Save aparrish/37f4661c2ece7fbda1916a84a85b28c9 to your computer and use it in GitHub Desktop.
aparrish workshop for machine literacy at sfpc

Introduction to Natural Language Processing and Word Vectors

Nothing is more essentially human than linguistic communication. But when programmers, data scientists, and computational linguists work with language, the abstractions they work with sometimes don't line up with your intuitive understanding of spelling and grammar. In this workshop, we'll investigate the state of the art of natural language processing, including: a whirlwind tour of spaCy for parsing English into syntactic constituents; a discussion of techniques for classifying and summarizing documents; and an explanation and demonstration of "word vectors" (like Google's word2vec), an innovative language technology that allows computers to process written language less as discrete units and more like a continuous signal. Workshop participants will develop a number of small projects in text analysis and poetics using a public domain text of their choice. In becoming familiar with contemporary techniques for computational language analysis, critics and researchers will be able to reason better about language-based media on the Internet. Artists and writers, meanwhile, might just learn a few new techniques to add to their creative palette.

Notes

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment