michaelguia/hierarchical.md

## hierarchical.md

      
    Raw
  

              hierarchical.md
            
          
    SUMMARY: Hierarchical cluster analysis


Hierarchical cluster analysis of n objects is defined by a stepwise algorithm which
merges two objects at each step, the two which have the least dissimilarity.


Dissimilarities between clusters of objects can be defined in several ways; for
example, the maximum dissimilarity (complete linkage), minimum dissimilarity
(single linkage) or average dissimilarity (average linkage).


Either rows or columns of a matrix can be clustered – in each case we choose the
appropriate dissimilarity measure that we prefer.


The results of a cluster analysis is a binary tree, or dendrogram, with n–1 nodes. The
branches of this tree are cut at a level where there is a lot of 'space' to cut them, that is
where the jump in levels of two consecutive nodes is large.


A permutation test is possible to validate the chosen number of clusters, that is to see
if there really is a non-random tendency for the objects to group together.