Skip to content

Instantly share code, notes, and snippets.

@kylepjohnson
kylepjohnson / tlg_auth_sent_data_v3.txt
Created September 21, 2014 16:33
Words per sentence data for TLG authors.
{'Elegiaca Adespota (CA)': {'sent_count': 10, 'word_count': 187, 'avg_words_per_sent': 18.7, 'tally_of_sent_word_lengths': {2: 1, 5: 1, 39: 1, 8: 1, 12: 1, 16: 3, 19: 1, 54: 1}}, 'Apollodorus Carystius vel Apollodorus Gelous Comic.': {'sent_count': 57, 'word_count': 856, 'avg_words_per_sent': 15.017543859649123, 'tally_of_sent_word_lengths': {1: 1, 2: 1, 3: 2, 5: 3, 6: 6, 11: 5, 12: 3, 13: 7, 14: 3, 15: 4, 16: 2, 17: 1, 18: 2, 19: 1, 20: 5, 22: 2, 23: 1, 24: 1, 25: 1, 27: 1, 28: 2, 30: 1, 32: 1, 49: 1}}, 'Aristocrates Hist.': {'sent_count': 26, 'word_count': 234, 'avg_words_per_sent': 9.0, 'tally_of_sent_word_lengths': {1: 12, 2: 3, 3: 1, 10: 1, 43: 1, 12: 1, 45: 1, 14: 1, 13: 1, 19: 2, 26: 1, 15: 1}}, 'Echembrotus Eleg. et Lyr.': {'sent_count': 2, 'word_count': 19, 'avg_words_per_sent': 9.5, 'tally_of_sent_word_lengths': {17: 1, 2: 1}}, 'Anonymi In Aristotelis Sophisticos Elenchos Phil.': {'sent_count': 2350, 'word_count': 56930, 'avg_words_per_sent': 24.22553191489362, 'tally_of_sent_word_lengths': {1: 26,
@kylepjohnson
kylepjohnson / avg_words_per_sentence_per_tlg_author_v3.py
Last active August 29, 2015 14:06
For computing sentence length data for TLG authors.
"""For computing sentence length data for TLG authors."""
import ast
from cltk.tokenize.sentence_tokenizer_greek import tokenize_greek_sentences
from collections import Counter
from nltk.tokenize import RegexpTokenizer
import os
import re
@kylepjohnson
kylepjohnson / tlg_auth_word_sentence_v3.csv
Last active August 29, 2015 14:06
CSV export of words per sentence data for TLG authors.
We can't make this file beautiful and searchable because it's too large.
@kylepjohnson
kylepjohnson / phi5_auth_sent_data_v3.txt
Last active August 29, 2015 14:06
Words per sentence data for TLG authors.
{'Sentius Augurinus': {'sent_count': 4, 'word_count': 45, 'avg_words_per_sent': 11.25, 'tally_of_sent_word_lengths': {17: 1, 10: 1, 5: 1, 13: 1}}, 'Cornelius Epicadus': {'sent_count': 1, 'word_count': 8, 'avg_words_per_sent': 8.0, 'tally_of_sent_word_lengths': {8: 1}}, 'Marcus Aurelius': {'sent_count': 1, 'word_count': 5, 'avg_words_per_sent': 5.0, 'tally_of_sent_word_lengths': {5: 1}}, 'Publius Rutilius Lupus': {'sent_count': 432, 'word_count': 4388, 'avg_words_per_sent': 10.157407407407407, 'tally_of_sent_word_lengths': {1: 78, 2: 31, 3: 5, 4: 20, 5: 11, 6: 14, 7: 18, 8: 24, 9: 27, 10: 25, 11: 23, 12: 15, 13: 21, 14: 18, 15: 13, 16: 15, 17: 8, 18: 12, 19: 6, 20: 6, 21: 7, 22: 3, 23: 3, 24: 4, 25: 3, 26: 2, 27: 4, 28: 1, 29: 2, 30: 1, 31: 1, 32: 1, 33: 1, 34: 1, 35: 2, 37: 2, 40: 1, 52: 2, 79: 1}}, 'Priapea': {'sent_count': 248, 'word_count': 3519, 'avg_words_per_sent': 14.189516129032258, 'tally_of_sent_word_lengths': {1: 1, 2: 8, 3: 5, 4: 9, 5: 12, 6: 17, 7: 11, 8: 6, 9: 7, 10: 14, 11: 18, 12: 14, 13: 23,
@kylepjohnson
kylepjohnson / avg_words_per_sentence_per_phi5_author_v3.py
Created September 21, 2014 19:11
For computing sentence length data for PHI5 authors.
"""For computing sentence length data for PHI5 authors."""
import ast
from cltk.tokenize.sentence_tokenizer_latin import tokenize_latin_sentences
from collections import Counter
from nltk.tokenize import RegexpTokenizer
import os
import re
@kylepjohnson
kylepjohnson / phi5_auth_word_sentence_v3.csv
Last active August 29, 2015 14:06
CSV export of words per sentence data for PHI5 authors.
We can make this file beautiful and searchable if this error is corrected: It looks like row 3 should actually have 214 columns, instead of 1. in line 2.
author,word_count,sent_count,5,6,1,avg_words_per_sent,2,3,4,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,25,27,28,29,31,33,40,44,24,26,30,32,34,35,36,37,38,39,41,42,45,47,48,49,51,54,55,57,59,65,80,82,107,43,50,52,53,58,62,67,68,76,88,46,106,136,63,64,193,69,70,73,84,86,87,91,92,100,90,109,119,56,60,61,66,71,72,74,75,77,78,79,81,83,85,89,93,95,96,98,97,105,108,110,112,113,122,125,126,147,153,155,178,104,114,121,148,204,94,102,115,133,146,144,145,99,101,111,117,118,120,123,129,130,132,103,364,142,149,151,159,160,161,163,167,179,189,200,116,128,124,199,227,248,134,184,127,196,158,352,139,157,162,164,192,217,226,252,135,137,138,141,143,150,154,581,168,284,174,186,287,191,198,203,205,271,209,280,255,212,214,237,279,236,140,302,166,197,211,222,241,247,183,176,131,781,172
Ablabius,12.0,3.0,1.0,1.0,1.0,4.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,
Aem
@kylepjohnson
kylepjohnson / gist:5a5aa14e2181709c9f93
Created April 24, 2015 13:49
all que enclitics, ranked
atque 20349
neque 14000
quoque 12658
itaque 5090
usque 2667
denique 2216
quisque 1675
namque 1658
quinque 1476
utique 1325
ne 17403
sine 8675
nomine 4131
bene 3753
ratione 2391
sane 2115
omne 1959
sanguine 1405
paene 1314
condicione 1266
siue 4851
praecipue 885
neue 503
ioue 422
graue 377
breue 217
leue 215
caue 209
salue 208
naue 188
vagrant@vagrant-ubuntu-trusty-64:/vagrant/morpheus/src$ make
cd greeklib; make greeklib.a
make[1]: Entering directory `/vagrant/morpheus/src/greeklib'
gcc -O2 -I../includes -c -o Fclose.o Fclose.c
Fclose.c: In function ‘xFree’:
Fclose.c:28:2: warning: incompatible implicit declaration of built-in function ‘free’ [enabled by default]
free(p);
^
gcc -O2 -I../includes -c -o addaccent.o addaccent.c
gcc -O2 -I../includes -c -o addbreath.o addbreath.c