Agentic AI Reasoning for Mobile Edge General Intelligence: Fundamentals, Approaches, and Directions

📅 2025-09-27
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In mobile edge general intelligence (MEGI) environments, autonomous LLM inference faces a fundamental tension between high computational overhead and severely constrained edge-device resources. Method: We propose a joint optimization framework integrating adaptive chain-of-thought (CoT) prompting with a distributed mixture-of-experts (MoE) architecture to enable dynamic inference—adjusting reasoning depth and expert activation count in real time based on task complexity and device capability—combined with supervised fine-tuning and lightweight dynamic resource scheduling. Contribution/Results: Experiments demonstrate that our framework significantly improves inference efficiency (2.3× speedup) and deployment scalability on resource-constrained edge devices, while preserving privacy and real-time responsiveness. To the best of our knowledge, this is the first work to achieve practical, high-quality autonomous LLM inference in MEGI scenarios.

Technology Category

Application Category

📝 Abstract
The rapid advancement of large language models (LLMs) has enabled an emergence of agentic artificial intelligence (AI) with powerful reasoning and autonomous decision-making capabilities. This integration with edge computing has led to the development of Mobile Edge General Intelligence (MEGI), which brings real-time, privacy-preserving reasoning to the network edge. However, deploying LLM-based agentic AI reasoning in MEGI environments poses significant challenges due to the high computational demands of reasoning and the limited resources of edge devices. To address these challenges, we propose a joint optimization framework for efficient LLM reasoning deployment in MEGI. First, we review methods that enhance LLM reasoning capabilities, such as Chain-of-Thought (CoT) prompting, Supervised Fine-Tuning (SFT), and Mixture of Experts (MoE). Next, we present a distributed framework that addresses two correlated aspects: reasoning enhancement through adaptive CoT prompting and scalable deployment through distributed MoE architecture. The framework dynamically activates expert networks and adjusts reasoning depth based on task complexity and device capabilities. We further conduct experimental evaluations in mobile edge environments. Experimental results demonstrate the framework's effectiveness in balancing reasoning quality with resource efficiency, validating the practical viability of deploying sophisticated LLM reasoning capabilities in resource-constrained MEGI environments.
Problem

Research questions and friction points this paper is trying to address.

Deploying LLM reasoning in resource-constrained edge environments
Balancing computational demands with limited edge device resources
Optimizing reasoning quality and efficiency for mobile edge intelligence
Innovation

Methods, ideas, or system contributions that make the work stand out.

Distributed framework for adaptive reasoning depth adjustment
Mixture of Experts architecture enabling scalable edge deployment
Dynamic expert network activation based on task complexity
🔎 Similar Papers
No similar papers found.
M
Mingyi Luo
Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen
Ruichen Zhang
Ruichen Zhang
Nanyang Technological University
Next-generation NetworkingEdge IntelligenceAgentic AIReinforcement learningLLM
Xiangwang Hou
Xiangwang Hou
Department of EE, Tsinghua University
Wireless Federated LearningEdge IntelligenceUAV/AUV Swarm
J
Jun Du
Department of Electronic Engineering and also with the State Key Laboratory of Space Network and Communications, Tsinghua University, Beijing 100084, China
C
Chunxiao Jiang
Beijing National Research Center for Information Science and Technology, and the State Key Laboratory of Space Network and Communications, Tsinghua University, Beijing 100084, China
Yong Ren
Yong Ren
Institute of Automation, Chinese Academy of Sciences
Speech CodecText-to-speechVideo-to-audioMLLMContinual Learning
D
Dusit Niyato
College of Computing and Data Science, Nanyang Technological University, Singapore
Shiwen Mao
Shiwen Mao
Professor and Earle C. Williams Eminent Scholar, Fellow of the IEEE, Dept. ECE, Auburn University
Wireless networkingmultimedia communicationsindoor localizationsmart healthsmart grid