Data sources:
Image above from http://www.sci.muni.cz/~paja/, Dr Pavel Hyrsl's home page.
Sequences are a mix of those from Grewal et al's Dauer library, published and available on GenBank's dbEST database (~1600 sequences) and a new, unpublished set of ESTs from Ann Burnell and colleagues, Maynooth, Ireland (~700 sequences).
Clustering information:
SUMMARY OF CLUSTERING FOR HBC
=======================================================
Number of sequences = 2263
Total number of clusters = 1998
Number of clusters with 1 member = 1822
Number of clusters with > 1 member
derived from 441 sequences = 176
=======================================================