Quantum Circuit Structure Optimization for Quantum Reinforcement Learning

๐Ÿ“… 2025-07-01
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
In quantum reinforcement learning (QRL), parameterized quantum circuits (PQCs) are typically designed heuristically, lacking theoretical guarantees of optimality and hindering efficient high-dimensional policy learning. Method: We propose QRL-NASโ€”the first differentiable quantum neural architecture search (QNAST) framework tailored for QRLโ€”that jointly optimizes quantum gate selection and placement to enable end-to-end automated discovery of PQC architectures. Integrating quantum neural networks with reinforcement learning, it leverages the linear and nonlinear transformation capabilities of quantum gates and employs gradient-based architecture search for efficient exploration of the quantum circuit space. Contribution/Results: Experiments across multiple benchmark tasks demonstrate that QRL-NAS significantly improves cumulative reward (average +23.6%) over fixed-architecture baselines, validating its effectiveness and generalizability. This work overcomes the heuristic-design bottleneck and establishes a new paradigm for scalable, high-performance QRL.

Technology Category

Application Category

๐Ÿ“ Abstract
Reinforcement learning (RL) enables agents to learn optimal policies through environmental interaction. However, RL suffers from reduced learning efficiency due to the curse of dimensionality in high-dimensional spaces. Quantum reinforcement learning (QRL) addresses this issue by leveraging superposition and entanglement in quantum computing, allowing efficient handling of high-dimensional problems with fewer resources. QRL combines quantum neural networks (QNNs) with RL, where the parameterized quantum circuit (PQC) acts as the core computational module. The PQC performs linear and nonlinear transformations through gate operations, similar to hidden layers in classical neural networks. Previous QRL studies, however, have used fixed PQC structures based on empirical intuition without verifying their optimality. This paper proposes a QRL-NAS algorithm that integrates quantum neural architecture search (QNAS) to optimize PQC structures within QRL. Experiments demonstrate that QRL-NAS achieves higher rewards than QRL with fixed circuits, validating its effectiveness and practical utility.
Problem

Research questions and friction points this paper is trying to address.

Optimizing quantum circuit structures for quantum reinforcement learning
Addressing dimensionality curse in RL via quantum computing advantages
Automating PQC design to improve QRL performance and efficiency
Innovation

Methods, ideas, or system contributions that make the work stand out.

Quantum neural architecture search optimizes circuits
Parameterized quantum circuits enhance reinforcement learning
Dynamic circuit structures improve learning efficiency
๐Ÿ”Ž Similar Papers
No similar papers found.
S
Seok Bin Son
Department of Electrical and Computer Engineering, Korea University, Seoul, Republic of Korea
Joongheon Kim
Joongheon Kim
Korea University Professor (Electrical Engineering) | USC PhD (Computer Science)
MobilityQuantum Machine LearningQuantum Deep LearningQuantum AI