Wordcount metrics:
find directorywithlogfiles/ -type f -exec cat {} \; | sed -re 's/[ "!?.,()]+/\n/g' | sed -re 's/(\n\n|\r)//g' | sort | uniq -ic | sort -rn > all.freq
This will give wordcount frequencies; but will leave names intact - you will need to delete those to ensure anonymity.
Other interesting metrics:
- Breakdown of pairings - M/M, M/F, etc. in percentages
- Interesting interactions, kinks, fetishes, interests