Skip to content

Instantly share code, notes, and snippets.

@kleem
kleem / README.md
Last active February 25, 2016 13:53
Cassandra word cloud IV (with search keyword)
@kleem
kleem / README.md
Last active March 10, 2016 15:55
Term frequencies II (normalized)
@kleem
kleem / README.md
Created December 23, 2015 17:22
Line chart II (normalized)
@kleem
kleem / README.md
Created December 21, 2015 13:44
Line Chart
@kleem
kleem / README.md
Created December 16, 2015 13:10
Horizon "Bar" Chart
@kleem
kleem / README.md
Last active December 5, 2015 10:51
Distribution of posts per user in forum (NVD3)
We couldn’t find that file to show.
@kleem
kleem / README.md
Last active December 4, 2015 19:33
langid.py

An example of usage of the langid.py language identification system.

Language codes returned by langid.py are converted into iso639-3. Hover on the results to see the confidence score as well as the language readable name.

This is just a test; the response is very delayed, since every time a query is issued the script loads the language models. The delay is avoidable by running langid as a service.

@kleem
kleem / README.md
Last active November 12, 2015 14:29
Distributions of posts per thread in Bluelight (Cassandra)

Log-log plot of the number of posts per thread in Bluelight forum. As shown, the log-log plot reveals a power-law distribution.

@kleem
kleem / README.md
Created November 3, 2015 12:41
Cassandra tf-idf
@kleem
kleem / README.md
Created October 24, 2015 19:00
Bluelight Keyword Over Time