Skip to content

Instantly share code, notes, and snippets.

@brh55
Last active May 13, 2016 14:33
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save brh55/1b339cce01c33dfeb929f4be4676bf8a to your computer and use it in GitHub Desktop.
Save brh55/1b339cce01c33dfeb929f4be4676bf8a to your computer and use it in GitHub Desktop.
Coefficients for Document Retrieval

What for?

Correcting for differences/size for the number of overlapping terms x or y being the size of document or vocabulary of DOC1 to DOC2

Cosine Coefficient

| x and y | / [x]1/2 x [y]1/2

Jaccard Coefficient

| x and x | / | x or y |

Overlap Coefficient

| x and y | / min([x], [y])

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment