Exponential quantum advantage in processing massive classical data

πŸ“… 2026-04-08
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the challenge of achieving provable, universal quantum advantage in processing large-scale classical data, overcoming exponential bottlenecks faced by classical methods in model size, sample complexity, and runtime. The authors propose a quantum oracle sketching technique that integrates classical shadows with a random-sample-based quantum superposition access model. Remarkably, this approach enables online large-scale classification and dimensionality reduction using only a small-scale quantum computer requiring fewer than 60 logical qubitsβ€”i.e., polylogarithmic resources. The method establishes, for the first time, an exponential quantum advantage in classical-data machine learning that holds rigorously even under the assumption that BPP = BQP or when granting classical algorithms unlimited computational time. Empirical evaluations on real-world tasks, including single-cell RNA sequencing and movie review sentiment analysis, demonstrate a 4–6 orders-of-magnitude reduction in model size compared to classical counterparts while maintaining comparable predictive performance.
πŸ“ Abstract
Broadly applicable quantum advantage, particularly in classical data processing and machine learning, has been a fundamental open problem. In this work, we prove that a small quantum computer of polylogarithmic size can perform large-scale classification and dimension reduction on massive classical data by processing samples on the fly, whereas any classical machine achieving the same prediction performance requires exponentially larger size. Furthermore, classical machines that are exponentially larger yet below the required size need superpolynomially more samples and time. We validate these quantum advantages in real-world applications, including single-cell RNA sequencing and movie review sentiment analysis, demonstrating four to six orders of magnitude reduction in size with fewer than 60 logical qubits. These quantum advantages are enabled by quantum oracle sketching, an algorithm for accessing the classical world in quantum superposition using only random classical data samples. Combined with classical shadows, our algorithm circumvents the data loading and readout bottleneck to construct succinct classical models from massive classical data, a task provably impossible for any classical machine that is not exponentially larger than the quantum machine. These quantum advantages persist even when classical machines are granted unlimited time or if BPP=BQP, and rely only on the correctness of quantum mechanics. Together, our results establish machine learning on classical data as a broad and natural domain of quantum advantage and a fundamental test of quantum mechanics at the complexity frontier.
Problem

Research questions and friction points this paper is trying to address.

quantum advantage
classical data processing
machine learning
exponential separation
quantum oracle sketching
Innovation

Methods, ideas, or system contributions that make the work stand out.

quantum oracle sketching
exponential quantum advantage
classical shadows
polylogarithmic quantum circuits
classical data processing
πŸ”Ž Similar Papers