Path Signatures Enable Model-Free Mapping of RNA Modifications

📅 2025-11-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
RNA modification detection is hindered by reverse transcription biases and the reliance of nanopore sequencing tools on labeled training data. To address these limitations, we propose an unsupervised, training-free anomaly detection method: raw nanopore direct RNA sequencing current signals are transformed via path signatures to extract modification-sensitive features; modification sites are then localized through nearest-neighbor comparison and statistical significance testing—enabling robust, cross-RNA-type and cross-chemistry detection. This work introduces path signatures to epitranscriptomics for the first time, eliminating dependence on prior knowledge of modifications or large-scale annotated datasets. The method accurately recapitulates multiple known modifications in *E. coli* rRNA and discovers a novel, qRT-PCR-validated 2′-O-methylation site in dengue virus sfRNA, demonstrating high sensitivity and cross-species applicability.

Technology Category

Application Category

📝 Abstract
Detecting chemical modifications on RNA molecules remains a key challenge in epitranscriptomics. Traditional reverse transcription-based sequencing methods introduce enzyme- and sequence-dependent biases and fragment RNA molecules, confounding the accurate mapping of modifications across the transcriptome. Nanopore direct RNA sequencing offers a powerful alternative by preserving native RNA molecules, enabling the detection of modifications at single-molecule resolution. However, current computational tools can identify only a limited subset of modification types within well-characterized sequence contexts for which ample training data exists. Here, we introduce a model-free computational method that reframes modification detection as an anomaly detection problem, requiring only canonical (unmodified) RNA reads without any other annotated data. For each nanopore read, our approach extracts robust, modification-sensitive features from the raw ionic current signal at a site using the signature transform, then computes an anomaly score by comparing the resulting feature vector to its nearest neighbors in an unmodified reference dataset. We convert anomaly scores into statistical p-values to enable anomaly detection at both individual read and site levels. Validation on densely-modified extit{E. coli} rRNA demonstrates that our approach detects known sites harboring diverse modification types, without prior training on these modifications. We further applyied this framework to dengue virus (DENV) transcripts and mammalian mRNAs. For DENV sfRNA, it led to revealing a novel 2'-O-methylated site, which we validate orthogonally by qRT-PCR assays. These results demonstrate that our model-free approach operates robustly across different types of RNAs and datasets generated with different nanopore sequencing chemistries.
Problem

Research questions and friction points this paper is trying to address.

Detecting RNA modifications without enzyme-dependent biases in sequencing
Overcoming limitations of current tools requiring extensive training data
Developing model-free anomaly detection using canonical RNA reference data
Innovation

Methods, ideas, or system contributions that make the work stand out.

Model-free anomaly detection for RNA modifications
Signature transform extracts features from nanopore signals
Statistical p-values enable modification detection without training
🔎 Similar Papers
No similar papers found.
Maud Lemercier
Maud Lemercier
University of Oxford
P
Paola Arrubarrena
Department of Mathematics, Imperial College London, London, UK.
S
Salvatore Di Giorgio
Division of Immune Diversity, German Cancer Research Center, Heidelberg 69120, Germany.
J
Julia Brettschneider
Department of Statistics, University of Warwick, Coventry, UK.
Thomas Cass
Thomas Cass
Professor of Mathematics, Imperial College London
Probability Theory and Stochastic Analysis
I
Isabel S. Naarmann-de Vries
Klaus Tschira Institute for Integrative Computational Cardiology, Heidelberg University, Medical Faculty Heidelberg, 69120 Heidelberg, Germany.
A
Anastasia Papavasiliou
Department of Statistics, University of Warwick, Coventry, UK.
A
Alessia Ruggieri
Department of Infectious Diseases, Molecular Virology, Center for Integrative Infectious Disease Research, Heidelberg University, Medical Faculty Heidelberg, 69120 Heidelberg, Germany.
I
Irem Tellioglu
Division of Immune Diversity, German Cancer Research Center, Heidelberg 69120, Germany.
C
Chia Ching Wu
Department of Infectious Diseases, Molecular Virology, Center for Integrative Infectious Disease Research, Heidelberg University, Medical Faculty Heidelberg, 69120 Heidelberg, Germany.
F. Nina Papavasiliou
F. Nina Papavasiliou
Professor of Immunology, DKFZ
RNA editingtherapeutic base editingepitranscriptomicsDNA mutation
T
Terry Lyons
Mathematical Institute, University of Oxford, Oxford, UK.