Quantum Hierarchical Reinforcement Learning via Variational Quantum Circuits

๐Ÿ“… 2026-05-05
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF

career value

216K/year
๐Ÿค– AI Summary
This work explores the potential of quantum computing in hierarchical reinforcement learning to enhance performance and parameter efficiency while addressing the bottleneck in option-value estimation. Building upon the options-critic framework, we introduce variational quantum circuits for the first time in a systematic manner, constructing a quantum-classical hybrid agent that employs quantum circuits for feature extraction, option-value functions, termination functions, and intra-option policies. The proposed approach significantly reduces the number of trainable parametersโ€”by up to 66%โ€”and outperforms classical baselines on standard benchmarks. Furthermore, our analysis identifies key design factors in quantum circuits that critically influence performance, thereby establishing a new paradigm for efficient hybrid hierarchical reinforcement learning.
๐Ÿ“ Abstract
Reinforcement learning is one of the most challenging learning paradigms where efficacy and efficiency gains are extremely valuable. Hierarchical reinforcement learning is a variant that leverages temporal abstraction to structure decision-making. While parametrized quantum computations have shown success in non-hierarchical reinforcement learning, whether these advantages adapt to hierarchical decision-making remains a critical open question. In this work, we develop a hybrid hierarchical agent based on the option-critic architecture. This hybrid agent substitutes classical components with variational quantum circuits for feature extractors, option-value functions, termination functions, and intra-option policies. Evaluated on standard benchmarking environments, results show that a hybrid agent utilizing a quantum feature extractor can outperform classical baselines while saving up to 66\% trainable parameters. We also identify an architectural bottleneck that quantum option-value estimation severely degrades performance. Further ablation studies reveal how architectural choices of the quantum circuits affect performance. Our work establishes design principles for parameter-efficient hybrid hierarchical agents.
Problem

Research questions and friction points this paper is trying to address.

Hierarchical Reinforcement Learning
Quantum Computing
Variational Quantum Circuits
Temporal Abstraction
Option-Critic Architecture
Innovation

Methods, ideas, or system contributions that make the work stand out.

Variational Quantum Circuits
Hierarchical Reinforcement Learning
Quantum Feature Extractor
Option-Critic Architecture
Parameter Efficiency
๐Ÿ”Ž Similar Papers
2023-11-09International Conference on Agents and Artificial IntelligenceCitations: 3