The branch-site test of positive selection is surprisingly robust but lacks power under synonymous substitution saturation and variation in GC.

Détails

Ressource 1Télécharger: BIB_E31E37CD7B01.P001.pdf (1027.81 [Ko])
Etat: Public
Version: de l'auteur⸱e
ID Serval
serval:BIB_E31E37CD7B01
Type
Article: article d'un périodique ou d'un magazine.
Collection
Publications
Institution
Titre
The branch-site test of positive selection is surprisingly robust but lacks power under synonymous substitution saturation and variation in GC.
Périodique
Molecular Biology and Evolution
Auteur⸱e⸱s
Gharib W.H., Robinson-Rechavi M.
ISSN
1537-1719 (Electronic)
ISSN-L
0737-4038
Statut éditorial
Publié
Date de publication
2013
Peer-reviewed
Oui
Volume
30
Numéro
7
Pages
1675-1686
Langue
anglais
Notes
Publication types: Journal Article ; Research Support, Non-U.S. Gov't
Publication Status: ppublish
Résumé
Positive selection is widely estimated from protein coding sequence alignments by the nonsynonymous-to-synonymous ratio ω. Increasingly elaborate codon models are used in a likelihood framework for this estimation. Although there is widespread concern about the robustness of the estimation of the ω ratio, more efforts are needed to estimate this robustness, especially in the context of complex models. Here, we focused on the branch-site codon model. We investigated its robustness on a large set of simulated data. First, we investigated the impact of sequence divergence. We found evidence of underestimation of the synonymous substitution rate for values as small as 0.5, with a slight increase in false positives for the branch-site test. When dS increases further, underestimation of dS is worse, but false positives decrease. Interestingly, the detection of true positives follows a similar distribution, with a maximum for intermediary values of dS. Thus, high dS is more of a concern for a loss of power (false negatives) than for false positives of the test. Second, we investigated the impact of GC content. We showed that there is no significant difference of false positives between high GC (up to ∼80%) and low GC (∼30%) genes. Moreover, neither shifts of GC content on a specific branch nor major shifts in GC along the gene sequence generate many false positives. Our results confirm that the branch-site is a very conservative test.
Mots-clé
Amino Acid Substitution/genetics, Base Composition/genetics, Codon/genetics, Computer Simulation, Evolution, Molecular, Models, Genetic, Open Reading Frames/genetics, Phylogeny, Selection, Genetic
Pubmed
Web of science
Open Access
Oui
Création de la notice
25/03/2013 8:45
Dernière modification de la notice
20/08/2019 16:06
Données d'usage