Early View | Published Version Download ( PDF | 2MB) | License: Creative Commons Attribution Non-commercial 4.0 |
Evidence for the preferential reuse of sub‐domain motifs in primordial protein folds
Heizinger, Leonhard and Merkl, Rainer
(2021)
Evidence for the preferential reuse of sub‐domain motifs in primordial protein folds.
Proteins: Structure, Function, and Bioinformatics 89, pp. 1167-1179.
Date of publication of this fulltext: 19 May 2021 05:43
Article
DOI to cite this document: 10.5283/epub.45812
Abstract
A comparison of protein backbones makes clear that not more than approximately 1400 different folds exist, each specifying the three-dimensional topology of a protein domain. Large proteins are composed of specific domain combinations and many domains can accommodate different functions. These findings confirm that the reuse of domains is key for the evolution of multi-domain proteins. If reuse ...
A comparison of protein backbones makes clear that not more than approximately 1400 different folds exist, each specifying the three-dimensional topology of a protein domain. Large proteins are composed of specific domain combinations and many domains can accommodate different functions. These findings confirm that the reuse of domains is key for the evolution of multi-domain proteins. If reuse was also the driving force for domain evolution, ancestral fragments of sub-domain size exist that are shared between domains possessing significantly different topologies. For the fully automated detection of putatively ancestral motifs, we developed the algorithm Fragstatt that compares proteins pairwise to identify fragments, that is, instantiations of the same motif. To reach maximal sensitivity, Fragstatt compares sequences by means of cascaded alignments of profile Hidden Markov Models. If the fragment sequences are sufficiently similar, the program determines and scores the structural concordance of the fragments. By analyzing a comprehensive set of proteins from the CATH database, Fragstatt identified 12 532 partially overlapping and structurally similar motifs that clustered to 134 unique motifs. The dissemination of these motifs is limited: We found only two domain topologies that contain two different motifs and generally, these motifs occur in not more than 18% of the CATH topologies. Interestingly, motifs are enriched in topologies that are considered ancestral. Thus, our findings suggest that the reuse of sub-domain sized fragments was relevant in early phases of protein evolution and became less important later on.
Alternative links to fulltext
Involved Institutions
Details
| Item type | Article | ||||
| Journal or Publication Title | Proteins: Structure, Function, and Bioinformatics | ||||
| Publisher: | Wiley | ||||
|---|---|---|---|---|---|
| Place of Publication: | HOBOKEN | ||||
| Volume: | 89 | ||||
| Page Range: | pp. 1167-1179 | ||||
| Date | 6 May 2021 | ||||
| Institutions | Biology, Preclinical Medicine > Institut für Biophysik und physikalische Biochemie > Prof. Dr. Rainer Merkl | ||||
| Identification Number |
| ||||
| Keywords | EVOLUTION; DATABASE; CLASSIFICATION; IDENTIFICATION; SEQUENCES; SYMMETRY; MODEL; ancient modules; domain evolution; fold space; protein evolution | ||||
| Dewey Decimal Classification | 500 Science > 570 Life sciences | ||||
| Status | Published | ||||
| Refereed | Yes, this version has been refereed | ||||
| Created at the University of Regensburg | Yes | ||||
| URN of the UB Regensburg | urn:nbn:de:bvb:355-epub-458125 | ||||
| Item ID | 45812 |
Download Statistics
Download Statistics