The Use of Non-Variant Sites to Improve the Clinical Assessment of Whole-Genome Sequence Data.

Details

Ressource 1Download: 26147798_BIB_7A3E586E90E2.pdf (307.02 [Ko])
State: Public
Version: Final published version
License: CC BY 4.0
Serval ID
serval:BIB_7A3E586E90E2
Type
Article: article from journal or magazin.
Collection
Publications
Title
The Use of Non-Variant Sites to Improve the Clinical Assessment of Whole-Genome Sequence Data.
Journal
PloS one
Author(s)
Ferrarini Alberto, Xumerle Luciano, Griggio Francesca, Garonzi Marianna, Cantaloni Chiara, Centomo Cesare, Vargas Sergio Marin, Descombes Patrick, Marquis Julien, Collino Sebastiano, Franceschi Claudio, Garagnani Paolo, Salisbury Benjamin A., Harvey John Max, Delledonne Massimo
Publication state
Published
Issued date
2015
Volume
10
Number
7
Pages
e0132180
Language
english
Abstract
Genetic testing, which is now a routine part of clinical practice and disease management protocols, is often based on the assessment of small panels of variants or genes. On the other hand, continuous improvements in the speed and per-base costs of sequencing have now made whole exome sequencing (WES) and whole genome sequencing (WGS) viable strategies for targeted or complete genetic analysis, respectively. Standard WGS/WES data analytical workflows generally rely on calling of sequence variants respect to the reference genome sequence. However, the reference genome sequence contains a large number of sites represented by rare alleles, by known pathogenic alleles and by alleles strongly associated to disease by GWAS. It’s thus critical, for clinical applications of WGS and WES, to interpret whether non-variant sites are homozygous for the reference allele or if the corresponding genotype cannot be reliably called. Here we show that an alternative analytical approach based on the analysis of both variant and non-variant sites from WGS data allows to genotype more than 92% of sites corresponding to known SNPs compared to 6% genotyped by standard variant analysis. These include homozygous reference sites of clinical interest, thus leading to a broad and comprehensive characterization of variation necessary to an accurate evaluation of disease risk. Altogether, our findings indicate that characterization of both variant and non-variant clinically informative sites in the genome is necessary to allow an accurate clinical assessment of a personal genome. Finally, we propose a highly efficient extended VCF (eVCF) file format which allows to store genotype calls for sites of clinical interest while remaining compatible with current variant interpretation software.
Keywords
Humans, Female, Male, Exome, *Polymorphism, Single Nucleotide, High-Throughput Nucleotide Sequencing, *Alleles, *Genome-Wide Association Study, *Genome, Human, *Homozygote, Long QT Syndrome/*genetics
Pubmed
Create date
19/02/2020 12:23
Last modification date
29/04/2021 12:34
Usage data