DiscoSnp-RAD: de novo detection of small variants for RAD-Seq population genomics.

Détails

Ressource 1Télécharger: peerj-9291.pdf (2248.35 [Ko])
Etat: Public
Version: Final published version
Licence: CC BY 4.0
Document(s) secondaire(s)
Télécharger: Supplementary_materials.pdf (488.40 [Ko])
Etat: Public
Version: Supplementary document
Licence: Non spécifiée
ID Serval
serval:BIB_C5B8525DC34D
Type
Article: article d'un périodique ou d'un magazine.
Collection
Publications
Institution
Titre
DiscoSnp-RAD: de novo detection of small variants for RAD-Seq population genomics.
Périodique
PeerJ
Auteur⸱e⸱s
Gauthier J., Mouden C., Suchan T., Alvarez N., Arrigo N., Riou C., Lemaitre C., Peterlongo P.
ISSN
2167-8359 (Print)
ISSN-L
2167-8359
Statut éditorial
Publié
Date de publication
2020
Peer-reviewed
Oui
Volume
8
Pages
e9291
Langue
anglais
Notes
Publication types: Journal Article
Publication Status: epublish
Résumé
Restriction site Associated DNA Sequencing (RAD-Seq) is a technique characterized by the sequencing of specific loci along the genome that is widely employed in the field of evolutionary biology since it allows to exploit variants (mainly Single Nucleotide Polymorphism-SNPs) information from entire populations at a reduced cost. Common RAD dedicated tools, such as STACKS or IPyRAD, are based on all-vs-all read alignments, which require consequent time and computing resources. We present an original method, DiscoSnp-RAD, that avoids this pitfall since variants are detected by exploiting specific parts of the assembly graph built from the reads, hence preventing all-vs-all read alignments. We tested the implementation on simulated datasets of increasing size, up to 1,000 samples, and on real RAD-Seq data from 259 specimens of Chiastocheta flies, morphologically assigned to seven species. All individuals were successfully assigned to their species using both STRUCTURE and Maximum Likelihood phylogenetic reconstruction. Moreover, identified variants succeeded to reveal a within-species genetic structure linked to the geographic distribution. Furthermore, our results show that DiscoSnp-RAD is significantly faster than state-of-the-art tools. The overall results show that DiscoSnp-RAD is suitable to identify variants from RAD-Seq data, it does not require time-consuming parameterization steps and it stands out from other tools due to its completely different principle, making it substantially faster, in particular on large datasets.
Mots-clé
Deletions, Insertions, RAD-seq, Reference-free, SNPs, Variants
Pubmed
Web of science
Open Access
Oui
Création de la notice
03/07/2020 18:46
Dernière modification de la notice
13/01/2024 7:22
Données d'usage