The branch-site test of positive selection is surprisingly robust but lacks power under synonymous substitution saturation and variation in GC.

Details

Ressource 1Download: BIB_E31E37CD7B01.P001.pdf (1027.81 [Ko])
State: Public
Version: author
Serval ID
serval:BIB_E31E37CD7B01
Type
Article: article from journal or magazin.
Collection
Publications
Institution
Title
The branch-site test of positive selection is surprisingly robust but lacks power under synonymous substitution saturation and variation in GC.
Journal
Molecular Biology and Evolution
Author(s)
Gharib W.H., Robinson-Rechavi M.
ISSN
1537-1719 (Electronic)
ISSN-L
0737-4038
Publication state
Published
Issued date
2013
Peer-reviewed
Oui
Volume
30
Number
7
Pages
1675-1686
Language
english
Notes
Publication types: Journal Article ; Research Support, Non-U.S. Gov't
Publication Status: ppublish
Abstract
Positive selection is widely estimated from protein coding sequence alignments by the nonsynonymous-to-synonymous ratio ω. Increasingly elaborate codon models are used in a likelihood framework for this estimation. Although there is widespread concern about the robustness of the estimation of the ω ratio, more efforts are needed to estimate this robustness, especially in the context of complex models. Here, we focused on the branch-site codon model. We investigated its robustness on a large set of simulated data. First, we investigated the impact of sequence divergence. We found evidence of underestimation of the synonymous substitution rate for values as small as 0.5, with a slight increase in false positives for the branch-site test. When dS increases further, underestimation of dS is worse, but false positives decrease. Interestingly, the detection of true positives follows a similar distribution, with a maximum for intermediary values of dS. Thus, high dS is more of a concern for a loss of power (false negatives) than for false positives of the test. Second, we investigated the impact of GC content. We showed that there is no significant difference of false positives between high GC (up to ∼80%) and low GC (∼30%) genes. Moreover, neither shifts of GC content on a specific branch nor major shifts in GC along the gene sequence generate many false positives. Our results confirm that the branch-site is a very conservative test.
Keywords
Amino Acid Substitution/genetics, Base Composition/genetics, Codon/genetics, Computer Simulation, Evolution, Molecular, Models, Genetic, Open Reading Frames/genetics, Phylogeny, Selection, Genetic
Pubmed
Web of science
Open Access
Yes
Create date
25/03/2013 9:45
Last modification date
20/08/2019 17:06
Usage data