Repeat Detector: versatile sizing of expanded tandem repeats and identification of interrupted alleles from targeted DNA sequencing.

Details

Ressource 1Download: RepeatDetector.pdf (981.56 [Ko])
State: Public
Version: Final published version
License: CC BY 4.0
Serval ID
serval:BIB_16ADA4432905
Type
Article: article from journal or magazin.
Collection
Publications
Institution
Title
Repeat Detector: versatile sizing of expanded tandem repeats and identification of interrupted alleles from targeted DNA sequencing.
Journal
NAR genomics and bioinformatics
Author(s)
Taylor A.S., Barros D., Gobet N., Schuepbach T., McAllister B., Aeschbach L., Randall E.L., Trofimenko E., Heuchan E.R., Barszcz P., Ciosi M., Morgan J., Hafford-Tear N.J., Davidson A.E., Massey T.H., Monckton D.G., Jones L., Network RIOTEHD, Xenarios I., Dion V.
ISSN
2631-9268 (Electronic)
ISSN-L
2631-9268
Publication state
Published
Issued date
12/2022
Peer-reviewed
Oui
Volume
4
Number
4
Pages
lqac089
Language
english
Notes
Publication types: Journal Article
Publication Status: epublish
Abstract
Targeted DNA sequencing approaches will improve how the size of short tandem repeats is measured for diagnostic tests and preclinical studies. The expansion of these sequences causes dozens of disorders, with longer tracts generally leading to a more severe disease. Interrupted alleles are sometimes present within repeats and can alter disease manifestation. Determining repeat size mosaicism and identifying interruptions in targeted sequencing datasets remains a major challenge. This is in part because standard alignment tools are ill-suited for repetitive and unstable sequences. To address this, we have developed Repeat Detector (RD), a deterministic profile weighting algorithm for counting repeats in targeted sequencing data. We tested RD using blood-derived DNA samples from Huntington's disease and Fuchs endothelial corneal dystrophy patients sequenced using either Illumina MiSeq or Pacific Biosciences single-molecule, real-time sequencing platforms. RD was highly accurate in determining repeat sizes of 609 blood-derived samples from Huntington's disease individuals and did not require prior knowledge of the flanking sequences. Furthermore, RD can be used to identify alleles with interruptions and provide a measure of repeat instability within an individual. RD is therefore highly versatile and may find applications in the diagnosis of expanded repeat disorders and in the development of novel therapies.
Keywords
tandem repeats diseases, sequencing, detection software
Pubmed
Web of science
Open Access
Yes
Create date
19/12/2022 12:50
Last modification date
19/07/2023 7:08
Usage data