Distribution of protein size (in terms of residue count) in the represented domain dataset,
has been shown in the left plot.
The representative dataset uesed for the topology study is a non-redundant set of domains which share a sequence identity of less than 40%. The Domain dataset in a exclusive set combination of representtaive proteins from both CATH (v. 4.1) and SCOP (v1.75)
The Domain Dataset can be browse in : Domain Search