Adaptive AI Agent Placement and Migration in Edge Intelligence Systems

📅 2025-08-05
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Deploying and migrating LLM-driven AI agents in edge intelligence systems faces critical challenges—including resource constraints, environmental heterogeneity, and stringent real-time requirements—particularly in dynamic edge environments. Method: This paper proposes the first adaptive deployment and migration framework tailored for such scenarios. It jointly models resource constraints and end-to-end latency costs, introduces a lightweight state migration mechanism, and innovatively integrates ant colony optimization with large language models for collaborative, distributed decision-making to enable efficient agent placement and dynamic rescheduling. Contribution/Results: Implemented as a prototype on AgentScope and empirically evaluated across globally distributed edge servers, the framework significantly reduces deployment latency and migration overhead. Under multimodal workloads, it improves resource utilization by 32.7% and ensures QoS compliance for 99.2% of tasks.

Technology Category

Application Category

📝 Abstract
The rise of LLMs such as ChatGPT and Claude fuels the need for AI agents capable of real-time task handling. However, migrating data-intensive, multi-modal edge workloads to cloud data centers, traditionally used for agent deployment, introduces significant latency. Deploying AI agents at the edge improves efficiency and reduces latency. However, edge environments present challenges due to limited and heterogeneous resources. Maintaining QoS for mobile users necessitates agent migration, which is complicated by the complexity of AI agents coordinating LLMs, task planning, memory, and external tools. This paper presents the first systematic deployment and management solution for LLM-based AI agents in dynamic edge environments. We propose a novel adaptive framework for AI agent placement and migration in edge intelligence systems. Our approach models resource constraints and latency/cost, leveraging ant colony algorithms and LLM-based optimization for efficient decision-making. It autonomously places agents to optimize resource utilization and QoS and enables lightweight agent migration by transferring only essential state. Implemented on a distributed system using AgentScope and validated across globally distributed edge servers, our solution significantly reduces deployment latency and migration costs.
Problem

Research questions and friction points this paper is trying to address.

Optimizing AI agent placement in edge systems to reduce latency
Managing resource constraints in heterogeneous edge environments
Enabling efficient migration of complex AI agents for QoS
Innovation

Methods, ideas, or system contributions that make the work stand out.

Adaptive AI agent placement in edge systems
Ant colony algorithms for resource optimization
Lightweight migration with essential state transfer
🔎 Similar Papers
No similar papers found.
X
Xingdan Wang
Institute of Artificial Intelligence and Future Networks, Beijing Normal University, China
J
Jiayi He
Faculty of Arts and Sciences, Beijing Normal University, China
Zhiqing Tang
Zhiqing Tang
Associate Professor, Beijing Normal University
Edge ComputingEdge AI SystemsContainerReinforcement Learning
Jianxiong Guo
Jianxiong Guo
Associate Professor of Computer Science, Beijing Normal University
IoT/Edge IntelligenceOnline/Federated LearningSocial ComputingCombinatorial Optimization
Jiong Lou
Jiong Lou
Research Assistant Professor, Shanghai Jiao Tong University
Edge computingBlockchain
L
Liping Qian
College of Information Engineering, Zhejiang University of Technology, China
T
Tian Wang
Institute of Artificial Intelligence and Future Networks, Beijing Normal University, China
Weijia Jia
Weijia Jia
FIEEE, Chair Professor, Beijing Normal University and UIC
Cyber Intelligent ComputingNetworking