Large-Scale Comparative Analysis of Codon Models Accounting for Protein and Nucleotide Selection.
Détails
Télécharger: 30847475_BIB_FF45CD1C1260.pdf (589.33 [Ko])
Etat: Public
Version: Final published version
Licence: CC BY 4.0
Etat: Public
Version: Final published version
Licence: CC BY 4.0
ID Serval
serval:BIB_FF45CD1C1260
Type
Article: article d'un périodique ou d'un magazine.
Collection
Publications
Institution
Titre
Large-Scale Comparative Analysis of Codon Models Accounting for Protein and Nucleotide Selection.
Périodique
Molecular biology and evolution
ISSN
1537-1719 (Electronic)
ISSN-L
0737-4038
Statut éditorial
Publié
Date de publication
01/06/2019
Peer-reviewed
Oui
Volume
36
Numéro
6
Pages
1316-1332
Langue
anglais
Notes
Publication types: Comparative Study ; Evaluation Studies ; Journal Article ; Research Support, Non-U.S. Gov't
Publication Status: ppublish
Publication Status: ppublish
Résumé
There are numerous sources of variation in the rate of synonymous substitutions inside genes, such as direct selection on the nucleotide sequence, or mutation rate variation. Yet scans for positive selection rely on codon models which incorporate an assumption of effectively neutral synonymous substitution rate, constant between sites of each gene. Here we perform a large-scale comparison of approaches which incorporate codon substitution rate variation and propose our own simple yet effective modification of existing models. We find strong effects of substitution rate variation on positive selection inference. More than 70% of the genes detected by the classical branch-site model are presumably false positives caused by the incorrect assumption of uniform synonymous substitution rate. We propose a new model which is strongly favored by the data while remaining computationally tractable. With the new model we can capture signatures of nucleotide level selection acting on translation initiation and on splicing sites within the coding region. Finally, we show that rate variation is highest in the highly recombining regions, and we propose that recombination and mutation rate variation, such as high CpG mutation rate, are the two main sources of nucleotide rate variation. Although we detect fewer genes under positive selection in Drosophila than without rate variation, the genes which we detect contain a stronger signal of adaptation of dynein, which could be associated with Wolbachia infection. We provide software to perform positive selection analysis using the new model.
Mots-clé
Animals, Codon, Computer Simulation, Drosophila/genetics, Models, Genetic, Mutation Rate, Recombination, Genetic, Selection, Genetic, Silent Mutation, Vertebrates/genetics, codon models, positive selection, substitution rate variation, synonymous substitutions
Pubmed
Web of science
Open Access
Oui
Création de la notice
04/03/2019 15:26
Dernière modification de la notice
15/01/2021 7:12