Reductive evolution of proteomes and protein structures
Författare
Summary, in English
The lengths of orthologous protein families in Eukarya are almost double the lengths found in Bacteria and Archaea. Here we examine protein structures in 745 genomes and show that protein length differences between superkingdoms arise as much shorter prokaryotic nondomain linker sequences. Eukaryotic, bacterial, and archaeal linkers are 250, 86, and 73 aa residues in length, respectively, whereas folded domain sequences are 281, 280, and 256 residues, respectively. Cryptic domains match linkers (P < 0.0001) with probabilities ranging between 0.022 and 0.042; accordingly, they do not affect length estimates significantly. Linker sequences support intermolecular binding within proteomes and they are probably enriched in intrinsically disordered regions as well. Reductively evolved linker sequence lengths in growth rate maximized cells should be proportional to proteome diversity. By using total in-frame coding capacity of a genome [i.e., coding sequence (CDS)] as a reliable measure of proteome diversity, we find linker lengths of prokaryotes clearly evolve in proportion to CDS values, whereas those of eukaryotes are more randomly larger than expected. Domain lengths scarcely change over the entire range of CDS values. Thus, the protein linkers of prokaryotes evolve reductively whereas those of eukaryotes do not.
Avdelning/ar
Publiceringsår
2011
Språk
Engelska
Sidor
11954-11958
Publikation/Tidskrift/Serie
Proceedings of the National Academy of Sciences
Volym
108
Issue
29
Dokumenttyp
Artikel i tidskrift
Förlag
National Academy of Sciences
Ämne
- Biological Sciences
Nyckelord
- protein domain
- evolutionary constraint
- intrinsic disorder
Status
Published
Forskningsgrupp
- Microbial Ecology
ISBN/ISSN/Övrigt
- ISSN: 1091-6490