🤖 AI Summary
Classic reinforcement learning (RL) suffers from poor generalization, low sample efficiency, and limited interpretability in dynamic wireless networks. To address these limitations, this paper proposes a large language model (LLM)-enhanced RL paradigm. We introduce a novel four-role taxonomy for LLMs in RL—state perception, reward shaping, decision making, and environment generation—and establish a rigorous theoretical framework and methodology. By integrating contextual reasoning, knowledge transfer, and interactive generation, our approach significantly improves policy generalization and decision transparency. We validate the framework across representative dynamic wireless scenarios: low-altitude economy networks, vehicular networks, and integrated space-air-ground networks. Furthermore, we release a comprehensive open-source tutorial and case library. This work provides a scalable, interpretable, and sample-efficient technical pathway toward AI-native intelligent network optimization.
📝 Abstract
Reinforcement Learning (RL) has shown remarkable success in enabling adaptive and data-driven optimization for various applications in wireless networks. However, classical RL suffers from limitations in generalization, learning feedback, interpretability, and sample efficiency in dynamic wireless environments. Large Language Models (LLMs) have emerged as a transformative Artificial Intelligence (AI) paradigm with exceptional capabilities in knowledge generalization, contextual reasoning, and interactive generation, which have demonstrated strong potential to enhance classical RL. This paper serves as a comprehensive tutorial on LLM-enhanced RL for wireless networks. We propose a taxonomy to categorize the roles of LLMs into four critical functions: state perceiver, reward designer, decision-maker, and generator. Then, we review existing studies exploring how each role of LLMs enhances different stages of the RL pipeline. Moreover, we provide a series of case studies to illustrate how to design and apply LLM-enhanced RL in low-altitude economy networking, vehicular networks, and space-air-ground integrated networks. Finally, we conclude with a discussion on potential future directions for LLM-enhanced RL and offer insights into its future development in wireless networks.