Noise-immune and AI-enhanced DNA storage via adaptive partition mapping of digital data

📅 2026-01-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the vulnerability of DNA-based data storage to noise during synthesis, preservation, and sequencing, where conventional error-correcting codes fail when errors exceed predefined thresholds. To overcome this limitation, the authors propose a Partitioned Mapping with Jump-and-Rotate (PJ) encoding scheme that eliminates inter-strand dependencies, thereby transforming strand loss into localized information gaps amenable to AI-driven inference for controlled recovery. This approach establishes the first universal DNA storage framework that does not require prior knowledge of error probabilities and enables successful file decoding under arbitrary strand loss rates, with information fidelity degrading gracefully as damage increases. Experimental results demonstrate robust data recovery under extreme conditions—including 10% strand loss, accelerated aging, and high-intensity X-ray irradiation—while preserving the classification performance of machine learning datasets, significantly enhancing storage robustness and fault tolerance.

Technology Category

Application Category

📝 Abstract
Encoding digital information into DNA sequences offers an attractive potential solution for storing rapidly growing data under the information age and the rise of artificial intelligence. However, practical implementations of DNA storage are constrained by errors introduced during synthesis, preservation, and sequencing processes, and traditional error-correcting codes remain vulnerable to noise levels that exceed predefined thresholds. Here, we developed a Partitioning-mapping with Jump-rotating (PJ) encoding scheme, which exhibits exceptional noise resilience. PJ removes cross-strand information dependencies so that strand loss manifests as localized gaps rather than catastrophic file failure. It prioritizes file decodability under arbitrary noise conditions and leverages AI-based inference to enable controllable recovery of digital information. For the intra-strand encoding, we develop a jump-rotating strategy that relaxes sequence constraints relative to conventional rotating codes and provides tunable information density via an adjustable jump length. Based on this encoding architecture, the original file information can always be decoded and recovered under any strand loss ratio, with fidelity degrading smoothly as damage increases. We demonstrate that original files can be effectively recovered even with 10% strand loss, and machine learning datasets stored under these conditions retain their classification performance. Experiments further confirmed that PJ successfully decodes image files after extreme environmental disturbance using accelerated aging and high-intensity X-ray irradiation. By eliminating reliance on prior error probabilities, PJ establishes a general framework for robust, archival DNA storage capable of withstanding the rigorous conditions of real-world preservation.
Problem

Research questions and friction points this paper is trying to address.

DNA storage
noise resilience
error correction
strand loss
data recovery
Innovation

Methods, ideas, or system contributions that make the work stand out.

DNA data storage
noise-resilient encoding
adaptive partition mapping
AI-enhanced recovery
jump-rotating code
🔎 Similar Papers
No similar papers found.
Z
Zimu Li
State Key Laboratory of Synergistic Chem-Bio Synthesis, School of Chemistry and Chemical Engineering, Frontiers Science Center for Transformative Molecules, Institute of Translational Medicine, Shanghai Jiao Tong University, Shanghai 200240, China
Bingyi Liu
Bingyi Liu
Professor, Department of CS and AI, Wuhan University of Technology
Internet of VehiclesEdge ComputingAutonomous VehiclesIntelligent Transportation Systems
Lei Zhao
Lei Zhao
Shanghai Jiao Tong University
OptimizationMachine Learning
Qian Zhang
Qian Zhang
Hong Kong University of Science and Technology
wireless networkingIoTSmart HealthcareCognitive radio networks and Dynamic Spectrum Managementmultimedia networking
Y
Yang Liu
State Key Laboratory of Synergistic Chem-Bio Synthesis, School of Chemistry and Chemical Engineering, Frontiers Science Center for Transformative Molecules, Institute of Translational Medicine, Shanghai Jiao Tong University, Shanghai 200240, China
J
Jun Liu
Lenovo Research, Lenovo Group, Beijing 100094, China
Ke Ke
Ke Ke
Central Washington University
H
Huating Kong
Shanghai Synchrotron Radiation Facility (SSRF), Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai, 201204, China
X
Xiaolei Zuo
Institute of Molecular Medicine, Shanghai Key Laboratory for Nucleic Acids Chemistry and Nanomedicine, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200127, China
Chunhai Fan
Chunhai Fan
Shanghai Jiao Tong University
Nucleic acids chemistryDNA nanotechnologyBioimaging and biosensors
F
Fei Wang
State Key Laboratory of Synergistic Chem-Bio Synthesis, School of Chemistry and Chemical Engineering, Frontiers Science Center for Transformative Molecules, Institute of Translational Medicine, Shanghai Jiao Tong University, Shanghai 200240, China