Double-digest RAD-sequencing: do pre- and post-sequencing protocol parameters impact biological results?

Details

Serval ID
serval:BIB_D2F6F94CB663
Type
Article: article from journal or magazin.
Collection
Publications
Institution
Title
Double-digest RAD-sequencing: do pre- and post-sequencing protocol parameters impact biological results?
Journal
Molecular genetics and genomics
Author(s)
Cumer T., Pouchon C., Boyer F., Yannic G., Rioux D., Bonin A., Capblancq T.
ISSN
1617-4623 (Electronic)
ISSN-L
1617-4623
Publication state
Published
Issued date
03/2021
Peer-reviewed
Oui
Volume
296
Number
2
Pages
457-471
Language
english
Notes
Publication types: Journal Article
Publication Status: ppublish
Abstract
Next-generation sequencing technologies have opened a new era of research in population genetics. Following these new sequencing opportunities, the use of restriction enzyme-based genotyping techniques, such as restriction site-associated DNA sequencing (RAD-seq) or double-digest RAD-sequencing (ddRAD-seq), has dramatically increased in the last decade. From DNA sampling to SNP calling, the laboratory and bioinformatic parameters of enzyme-based techniques have been investigated in the literature. However, the impact of those parameters on downstream analyses and biological results remains less documented. In this study, we investigated the effects of sevral pre- and post-sequencing settings on ddRAD-seq results for two biological systems: a complex of butterfly species (Coenonympha sp.) and several populations of common beech (Fagus sylvatica). Our results suggest that pre-sequencing parameters (i.e., DNA quantity, number of PCR cycles during library preparation) have a significant impact on the number of recovered reads and SNPs, on the number of unique alleles and on individual heterozygosity. In the same way, we found that post-sequencing settings (i.e., clustering and minimum coverage thresholds) influenced loci reconstruction (e.g., number of loci, mean coverage) and SNP calling (e.g., number of SNPs; heterozygosity) but had only a marginal impact on downstream analyses (e.g., measure of genetic differentiation, estimation of individual admixture, and demographic inferences). In addition, replication analyses confirmed the reproducibility of the ddRAD-seq procedure. Overall, this study assesses the degree of sensitivity of ddRAD-seq data to pre- and post-sequencing protocols, and illustrates its robustness when studying population genetics.
Keywords
Alleles, Animals, Butterflies/genetics, Computational Biology/methods, DNA Restriction Enzymes/metabolism, Fagus/genetics, Genetics, Population, High-Throughput Nucleotide Sequencing/methods, Polymorphism, Single Nucleotide, Reproducibility of Results, Sequence Analysis, DNA/methods, Bioinformatics, Laboratory protocol, Molecular biology, Next-generation sequencing
Pubmed
Web of science
Create date
26/01/2021 13:11
Last modification date
23/12/2023 8:06
Usage data