HiRL: Hierarchical Reinforcement Learning for Coordinated Resource Management in Heterogeneous Edge Computing

📅 2026-05-11
📈 Citations: 0
Influential: 0
📄 PDF

career value

237K/year
🤖 AI Summary
This work addresses the challenge of real-time resource orchestration in heterogeneous edge computing, where multidimensional resource heterogeneity, diverse task requirements, and dynamic network conditions complicate efficient scheduling. To tackle this, the authors propose HiRL, a hierarchical reinforcement learning framework that decouples the mixed continuous-discrete optimization problem into two coordinated levels: power control via Twin Delayed Deep Deterministic Policy Gradient (TD3) and task assignment using Double Deep Q-Networks (DDQN). Decision coordination is achieved through a five-dimensional queue state representation. The approach innovatively integrates continuous and discrete reinforcement learning with deadline-aware priority scheduling, adaptive sampling based on failure penalties, and a compatibility-aware evaluation mechanism for heterogeneous resources. Experiments demonstrate that HiRL reduces latency by 28% and achieves near-100% task completion rates compared to Single-DDQN, cuts energy consumption by up to 51% under low load, and improves delay by 24% over static methods under high load, significantly enhancing the trade-off between energy efficiency and latency.
📝 Abstract
Edge computing faces unprecedented resource orchestration challenges from multi-dimensional heterogeneity across device architectures, diverse task requirements in CPU-intensive, GPU-intensive, I/O-intensive, and dynamic network conditions. The edge environments demand real-time task processing within strict energy budgets, yet conventional approaches struggle with mixed continuous-discrete optimization while meeting deadline and energy constraints. This paper presents HiRL, a hierarchical reinforcement learning framework that decomposes complex resource orchestration into coordinated power control and task allocation decisions. Our approach separates continuous power management using the Twin Delayed Deep Deterministic Policy Gradient (TD3) and discrete task placement using Double Deep Q-Network (DDQN), unified through a coordination engine with five-dimensional queue state representation. We propose a heterogeneous assessment of resource compatibility with deadline-oriented prioritization and failure-penalized adaptive sampling to enhance decision quality under resource constraints. To improve practical applicability, the framework models comprehensive system dynamics including device mobility, queue congestion patterns, infrastructure heterogeneity, and priority-sensitive scheduling demands. Experimental results show that HiRL achieves effective latency-energy trade-offs with 28% latency reduction compared to Single-DDQN and maintains nearly 100% task completion rates under all load conditions. Compared to baseline algorithms, HiRL reduces energy consumption by up to 51% under low load while achieving 24% better latency performance than static optimization approaches under high load, establishing effective resource orchestration in heterogeneous edge environments.
Problem

Research questions and friction points this paper is trying to address.

heterogeneous edge computing
resource orchestration
deadline constraints
energy efficiency
mixed continuous-discrete optimization
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical Reinforcement Learning
Resource Orchestration
Edge Computing
Mixed Continuous-Discrete Optimization
Deadline-Aware Scheduling
🔎 Similar Papers
No similar papers found.
J
Jianyong Zhu
Yanzhao Electric Power Laboratory of North China Electric Power University, and Hebei Key Laboratory of Knowledge Computing for Energy & Power, Baoding 071000, China
H
Hao Chen
Department of Computer, North China Electric Power University, and Engineering Research Center of Intelligent Computing for Complex Energy Systems, Ministry of Education, Baoding 071000, China
Juan Zhang
Juan Zhang
Department of Mathematics, Xiangtan University
Matrix ComputationNumerical AlgebraNumerical Algorithm
F
Fangda Guo
State Key Laboratory of AI Safety, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
A
Albert Y. Zomaya
University of Sydney, Australia
Renyu Yang
Renyu Yang
Associate Professor, Beihang University; formerly, University of Leeds
Parallel and Distributed ComputingResource ManagementDeep Learning SystemsAnomaly Detection