Enhanced detection of RNA modifications and read mapping with high-accuracy nanopore RNA basecalling models.

Détails

ID Serval
serval:BIB_5FBE407DE554
Type
Article: article d'un périodique ou d'un magazine.
Collection
Publications
Institution
Titre
Enhanced detection of RNA modifications and read mapping with high-accuracy nanopore RNA basecalling models.
Périodique
Genome research
Auteur⸱e⸱s
Diensthuber G., Pryszcz L.P., Llovera L., Lucas M.C., Delgado-Tejedor A., Cruciani S., Roignant J.Y., Begik O., Novoa E.M.
ISSN
1549-5469 (Electronic)
ISSN-L
1088-9051
Statut éditorial
Publié
Date de publication
20/11/2024
Peer-reviewed
Oui
Volume
34
Numéro
11
Pages
1865-1877
Langue
anglais
Notes
Publication types: Journal Article
Publication Status: epublish
Résumé
In recent years, nanopore direct RNA sequencing (DRS) became a valuable tool for studying the epitranscriptome, owing to its ability to detect multiple modifications within the same full-length native RNA molecules. Although RNA modifications can be identified in the form of systematic basecalling "errors" in DRS data sets, N6-methyladenosine (m <sup>6</sup> A) modifications produce relatively low "errors" compared with other RNA modifications, limiting the applicability of this approach to m <sup>6</sup> A sites that are modified at high stoichiometries. Here, we demonstrate that the use of alternative RNA basecalling models, trained with fully unmodified sequences, increases the "error" signal of m <sup>6</sup> A, leading to enhanced detection and improved sensitivity even at low stoichiometries. Moreover, we find that high-accuracy alternative RNA basecalling models can show up to 97% median basecalling accuracy, outperforming currently available RNA basecalling models, which show 91% median basecalling accuracy. Notably, the use of high-accuracy basecalling models is accompanied by a significant increase in the number of mapped reads-especially in shorter RNA fractions-and increased basecalling error signatures at pseudouridine (Ψ)- and N1-methylpseudouridine (m <sup>1</sup> Ψ)-modified sites. Overall, our work demonstrates that alternative RNA basecalling models can be used to improve the detection of RNA modifications, read mappability, and basecalling accuracy in nanopore DRS data sets.
Mots-clé
RNA/genetics, Adenosine/analogs & derivatives, RNA Processing, Post-Transcriptional, Humans, Nanopore Sequencing/methods, Sequence Analysis, RNA/methods, Nanopores
Pubmed
Création de la notice
20/09/2024 14:32
Dernière modification de la notice
22/11/2024 17:55
Données d'usage