What for? Correcting for differences/size for the number of overlapping terms x or y being the size of document or vocabulary of DOC1 to DOC2 Cosine Coefficient | x and y | / [x]1/2 x [y]1/2 Jaccard Coefficient | x and x | / | x or y | Overlap Coefficient | x and y | / min([x], [y])