Multi-User MISO with Stacked Intelligent Metasurfaces: A DRL-Based Sum-Rate Optimization Approach

📅 2024-08-09
🏛️ arXiv.org
📈 Citations: 6
Influential: 0
📄 PDF
🤖 AI Summary
In multi-user multiple-input single-output (MISO) systems, limited radio-frequency (RF) chains lead to high computational complexity and low energy efficiency in conventional precoding. Method: This paper proposes a synergistic architecture integrating stacked intelligent metasurfaces (SIM) and deep reinforcement learning (DRL). It introduces multi-layer SIMs into MISO systems for the first time, constructing a parameterized wireless environment state space and jointly optimizing SIM phase responses and base station transmit power allocation. Proximal Policy Optimization (PPO) is employed with pre-designed states, data whitening, and hyperparameter tuning to enable lightweight, real-time control under stringent RF-chain and transmit-power constraints. Contribution/Results: Experiments demonstrate a 32% improvement in sum rate over conventional precoding, stable training convergence, and strong generalization capability—validating the effectiveness of SIM-DRL co-optimization in reducing hardware cost while enhancing spectral efficiency.

Technology Category

Application Category

📝 Abstract
Stacked intelligent metasurfaces (SIMs) represent a novel signal processing paradigm that enables over-the-air processing of electromagnetic waves at the speed of light. Their multi-layer architecture exhibits customizable computational capabilities compared to conventional single-layer reconfigurable intelligent surfaces and metasurface lenses. In this paper, we deploy SIM to improve the performance of multi-user multiple-input single-output (MISO) wireless systems through a low complexity manner with reduced numbers of transmit radio frequency chains. In particular, an optimization formulation for the joint design of the SIM phase shifts and the transmit power allocation is presented, which is efficiently tackled via a customized deep reinforcement learning (DRL) approach that systematically explores pre-designed states of the SIM-parametrized smart wireless environment. The presented performance evaluation results demonstrate the proposed method's capability to effectively learn from the wireless environment, while consistently outperforming conventional precoding schemes under low transmit power conditions. Furthermore, the implementation of hyperparameter tuning and whitening process significantly enhance the robustness of the proposed DRL framework.
Problem

Research questions and friction points this paper is trying to address.

Optimizing SIM phase shifts and power allocation for MISO systems
Enhancing wireless performance with low complexity and fewer RF chains
Using DRL to learn and outperform conventional precoding schemes
Innovation

Methods, ideas, or system contributions that make the work stand out.

Stacked intelligent metasurfaces enable light-speed signal processing
DRL optimizes SIM phase shifts and power allocation
Hyperparameter tuning enhances DRL framework robustness
🔎 Similar Papers
No similar papers found.
H
Hao Liu
School of Information and Communication Engineering, University of Electronic Science and Technology of China (UESTC), Chengdu, Sichuan 611731, China
Jiancheng An
Jiancheng An
Nanyang Technological University
Stacked Intelligent MetasurfaceFlexible Intelligent MetasurfaceSIMFIM
G
G. C. Alexandropoulos
Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, 15784 Athens, Greece
Derrick Wing Kwan Ng
Derrick Wing Kwan Ng
Scientia Associate Professor, University of New South Wales
Wireless Communications
Chau Yuen
Chau Yuen
IEEE Fellow, Highly Cited Researcher, Nanyang Technological University
WirelessSmart GridLocalizationIoTBig Data
L
Lu Gan
School of Information and Communication Engineering, University of Electronic Science and Technology of China (UESTC), Chengdu, Sichuan 611731, China; School of Information and Communication Engineering, the Yibin Institute of UESTC, Yibin, Sichuan 644000, China