Multi-User Beamforming with Deep Reinforcement Learning in Sensing-Aided Communication

📅 2025-05-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address beam misalignment caused by user mobility in millimeter-wave (mmWave) mobile communications, this paper proposes a feedback-free, model-free sensing-assisted dynamic beam management scheme. The method integrates real-time sensing echo data and angle-of-departure (AoD) estimation accuracy to adaptively switch between multi-beam and single-beam transmission—without requiring user feedback or prior knowledge of channel dynamics. Crucially, it is the first work to incorporate deep reinforcement learning (DRL) into a joint sensing-communication architecture for beam management. Performance is rigorously analyzed via angular discretization modeling and Cramér–Rao lower bound (CRLB)-driven evaluation. Results demonstrate significant throughput gains across diverse user velocities, achieving both high spectral efficiency and robustness to high-speed mobility—outperforming conventional beam sweeping and AoD-based heuristic approaches.

Technology Category

Application Category

📝 Abstract
Mobile users are prone to experience beam failure due to beam drifting in millimeter wave (mmWave) communications. Sensing can help alleviate beam drifting with timely beam changes and low overhead since it does not need user feedback. This work studies the problem of optimizing sensing-aided communication by dynamically managing beams allocated to mobile users. A multi-beam scheme is introduced, which allocates multiple beams to the users that need an update on the angle of departure (AoD) estimates and a single beam to the users that have satisfied AoD estimation precision. A deep reinforcement learning (DRL) assisted method is developed to optimize the beam allocation policy, relying only upon the sensing echoes. For comparison, a heuristic AoD-based method using approximated Cram'er-Rao lower bound (CRLB) for allocation is also presented. Both methods require neither user feedback nor prior state evolution information. Results show that the DRL-assisted method achieves a considerable gain in throughput than the conventional beam sweeping method and the AoD-based method, and it is robust to different user speeds.
Problem

Research questions and friction points this paper is trying to address.

Optimizing beam allocation in sensing-aided mmWave communications
Reducing beam failure via dynamic multi-beam management
Enhancing throughput using DRL without user feedback
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-beam scheme optimizes AoD estimation dynamically
DRL-assisted method enhances throughput without user feedback
Sensing echoes replace traditional beam sweeping overhead
🔎 Similar Papers
No similar papers found.
X
Xiyu Wang
Department of Electronic Systems, Aalborg University, Denmark
Gilberto Berardinelli
Gilberto Berardinelli
Aalborg University
Hei Victor Cheng
Hei Victor Cheng
Assistant Professor, Aarhus University
Communication TheorySignal ProcessingInformation TheoryB5G/6GMachine Learning
P
P. Popovski
Department of Electronic Systems, Aalborg University, Denmark
R
Ramoni O. Adeogun
Department of Electronic Systems, Aalborg University, Denmark