Skip to content

Instantly share code, notes, and snippets.

@teisman
Last active December 11, 2015 19:48
Show Gist options
  • Save teisman/4650436 to your computer and use it in GitHub Desktop.
Save teisman/4650436 to your computer and use it in GitHub Desktop.

##Common identifiers

  • M = # words
  • V = # unique words
  • C = # characters
  1. Number of blank lines/total number of lines
  2. Average sentence length
  3. Average word length
  4. Vocabulary richness V/M
  5. Total number of function words/M
  6. Function word frequancy distribution
  7. Total number of shortwords/M
  8. Count of hapax legomena/M
  9. Count of hapax legomena/V
  10. Total number of characters in words/C
  11. Total number of alphabetic characters in words/C
  12. Total number of upper-case characters in words/C
  13. Total number of digit characters in words/C
  14. Total number of white-space characters in words/C
  15. Total number of space characters/C
  16. Total number of space chaarcters/number white-space characters
  17. Total number of tab spaces/C
  18. Total number of tab spaces/number white-space characters
  19. Total number of punctuations/C
  20. World length frequence distribution/M

##Rules ###Capital letters Capital letters after .

###Punctuation No comma before and

###Whitespace Spaces after and no spaces before , . : Spaces before and no spaces after opening " ' Spaces after and no spaces before closing " '

###Use of words

###Use of digits

###Vocabulary richness metrics

###Function words the, if, to, ...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment