Protein length distribution is remarkably uniform across the tree of life.

Details

Serval ID
serval:BIB_2F921C9DBF4A
Type
Article: article from journal or magazin.
Collection
Publications
Institution
Title
Protein length distribution is remarkably uniform across the tree of life.
Journal
Genome biology
Author(s)
Nevers Y., Glover N.M., Dessimoz C., Lecompte O.
ISSN
1474-760X (Electronic)
ISSN-L
1474-7596
Publication state
Published
Issued date
08/06/2023
Peer-reviewed
Oui
Volume
24
Number
1
Pages
135
Language
english
Notes
Publication types: Journal Article
Publication Status: epublish
Abstract
In every living species, the function of a protein depends on its organization of structural domains, and the length of a protein is a direct reflection of this. Because every species evolved under different evolutionary pressures, the protein length distribution, much like other genomic features, is expected to vary across species but has so far been scarcely studied.
Here we evaluate this diversity by comparing protein length distribution across 2326 species (1688 bacteria, 153 archaea, and 485 eukaryotes). We find that proteins tend to be on average slightly longer in eukaryotes than in bacteria or archaea, but that the variation of length distribution across species is low, especially compared to the variation of other genomic features (genome size, number of proteins, gene length, GC content, isoelectric points of proteins). Moreover, most cases of atypical protein length distribution appear to be due to artifactual gene annotation, suggesting the actual variation of protein length distribution across species is even smaller.
These results open the way for developing a genome annotation quality metric based on protein length distribution to complement conventional quality measures. Overall, our findings show that protein length distribution between living species is more uniform than previously thought. Furthermore, we also provide evidence for a universal selection on protein length, yet its mechanism and fitness effect remain intriguing open questions.
Keywords
Genomics/methods, Archaea/genetics, Bacteria/genetics, Genome, Eukaryota/genetics, Phylogeny, Evolution, Molecular, Comparative genomics, Genome annotation, Genome evolution, Protein length
Pubmed
Open Access
Yes
Create date
14/06/2023 10:52
Last modification date
24/11/2023 8:14
Usage data