Details
Zusammenfassung: <jats:title>Abstract</jats:title><jats:p>Motivation: Proteins of the same class often share a secondary structure packing arrangement but differ in how the secondary structure units are ordered in the sequence. We find that proteins that share a common core also share local sequence–structure similarities, and these can be exploited to align structures with different topologies. In this study, segments from a library of local sequence–structure alignments were assembled hierarchically, enforcing the compactness and conserved inter-residue contacts but not sequential ordering. Previous structure-based alignment methods often ignore sequence similarity, local structural equivalence and compactness.</jats:p><jats:p>Results: The new program, SCALI (Structural Core ALIgnment), can efficiently find conserved packing arrangements, even if they are non-sequentially ordered in space. SCALI alignments conserve remote sequence similarity and contain fewer alignment errors. Clustering of our pairwise non-sequential alignments shows that recurrent packing arrangements exist in topologically different structures. For example, the three-layer sandwich domain architecture may be divided into four structural subclasses based on internal packing arrangements. These subclasses represent an intermediate level of structure classification, more general than topology, but more specific than architecture as defined in CATH. A strategy is presented for developing a set of predictive hidden Markov models based on multiple SCALI alignments.</jats:p><jats:p>Availability: An online topology-independent SCALI structure comparison server is available at http://www.bioinfo.rpi.edu/~bystrc/scali.html</jats:p><jats:p>Contact:  bystrc@rpi.edu</jats:p>
Umfang: 1010-1019
ISSN: 1367-4811
1367-4803
DOI: 10.1093/bioinformatics/bti128