|Script originally sourced from Peter Baumgartner|
|and then modified by Lynn Cherny to allow a corpus file,|
|any gensim w2v model file, and make or read a counts file before the|
|The counts are used to focus on the most common words, and more|
|frequent words show as lighter colors in the UMAP display Peter made.|
|NOTE: Pip install umap-learn not umap; the import method below fixes a bad install/umap issue.|
We can't make this file beautiful and searchable because it's too large.
|Unique Key,Created Date,Closed Date,Location Type,Incident Zip,Incident Address,Street Name,Cross Street 1,Cross Street 2,Address Type,City,Due Date,Resolution Action Updated Date,Community Board,Borough,Park Borough,Latitude,Longitude,Location 32962543,3/22/16 23:53,,3+ Family Apt. Building,11225,335 LEFFERTS AVENUE,LEFFERTS AVENUE,NOSTRAND AVENUE,NEW YORK AVENUE,ADDRESS,BROOKLYN,4/21/16 23:53,3/23/16 0:00,09 BROOKLYN,BROOKLYN,BROOKLYN,40.66239321,-73.95026623,"(40.66239320568563, -73.95026622688094)" 32966604,3/22/16 23:50,,Other (Explain Below),11233,1323 HERKIMER STREET,HERKIMER STREET,PLEASANT PLACE,MONACO PLACE,ADDRESS,BROOKLYN,4/21/16 23:50,3/22/16 23:53,16 BROOKLYN,BROOKLYN,BROOKLYN,40.67746197,-73.90959516,"(40.67746196594494, -73.90959515599286)" 32962660,3/22/16 23:22,,Other (Explain Below),10025,120 WEST 109 STREET,WEST 109 STREET,COLUMBUS AVENUE,AMSTERDAM AVENUE,ADDRESS,NEW YORK,4/21/16 23:22,3/22/16 23:30,07 MANHATTAN,MANHATTAN,MANHATTAN,40.80151565,-73.96229483,"(40.801515645553465, -73.9622948|
An illustration of TSNE layout of word2vec output from a subset of Yelp reviews.
Use the mouse to click on a dot and see the word plotted. Click a label to hide it again.
Color indicates polarity based on simple word labeling from the AFINN wordlist. It may be that context in this dataset affects polarity :)
|<meta http-equiv="Content-Type" content="text/html; charset=utf-8">|
|<title>Highcharts Example SlopeGraph</title>|