Protein length distribution is remarkably uniform across the tree of life.

Détails

ID Serval
serval:BIB_2F921C9DBF4A
Type
Article: article d'un périodique ou d'un magazine.
Collection
Publications
Institution
Titre
Protein length distribution is remarkably uniform across the tree of life.
Périodique
Genome biology
Auteur⸱e⸱s
Nevers Y., Glover N.M., Dessimoz C., Lecompte O.
ISSN
1474-760X (Electronic)
ISSN-L
1474-7596
Statut éditorial
Publié
Date de publication
08/06/2023
Peer-reviewed
Oui
Volume
24
Numéro
1
Pages
135
Langue
anglais
Notes
Publication types: Journal Article
Publication Status: epublish
Résumé
In every living species, the function of a protein depends on its organization of structural domains, and the length of a protein is a direct reflection of this. Because every species evolved under different evolutionary pressures, the protein length distribution, much like other genomic features, is expected to vary across species but has so far been scarcely studied.
Here we evaluate this diversity by comparing protein length distribution across 2326 species (1688 bacteria, 153 archaea, and 485 eukaryotes). We find that proteins tend to be on average slightly longer in eukaryotes than in bacteria or archaea, but that the variation of length distribution across species is low, especially compared to the variation of other genomic features (genome size, number of proteins, gene length, GC content, isoelectric points of proteins). Moreover, most cases of atypical protein length distribution appear to be due to artifactual gene annotation, suggesting the actual variation of protein length distribution across species is even smaller.
These results open the way for developing a genome annotation quality metric based on protein length distribution to complement conventional quality measures. Overall, our findings show that protein length distribution between living species is more uniform than previously thought. Furthermore, we also provide evidence for a universal selection on protein length, yet its mechanism and fitness effect remain intriguing open questions.
Mots-clé
Genomics/methods, Archaea/genetics, Bacteria/genetics, Genome, Eukaryota/genetics, Phylogeny, Evolution, Molecular, Comparative genomics, Genome annotation, Genome evolution, Protein length
Pubmed
Open Access
Oui
Création de la notice
14/06/2023 10:52
Dernière modification de la notice
24/11/2023 8:14
Données d'usage