Large scale tests - example 1

The data for these figures was generated in the following way. We generated 100 data points for each of the 1000 leaves in this data set. Most of these 100 points where generated at random, however (as can be seen) for each leaf we chose 30 points which where set to -1 (in different places in different leaves). We then computed the correlation coefficients between all the leaves and performed hierarchical clustering. Next we ordered the resulting tree using our optimal ordering algorithm.
Hierarchical clustering Optimal ordering