Deep Reinforcement Learning for Joint Time and Power Management in SWIPT-EH CIoT

📅 2025-12-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses the joint optimization of time allocation and power control in simultaneous wireless information and power transfer (SWIPT)-enabled cognitive Internet of Things (CIoT) networks. Under practical constraints—including small-scale fading, realistic energy harvesting dynamics, and stringent interference limits—the problem is formulated as a Markov decision process (MDP). To enable end-to-end autonomous resource coordination, we propose a novel Double Deep Q-Network with Upper Confidence Bound (DDQN-UCB) algorithm—the first to integrate UCB-based exploration into DDQN for CIoT resource management. The method jointly maximizes throughput and energy efficiency, thereby significantly extending network lifetime. Simulation results demonstrate that, compared to state-of-the-art deep reinforcement learning approaches, the proposed algorithm improves system throughput by 18.7% and increases average node lifetime by 23.4%, while satisfying strict real-time and energy-efficiency requirements.

Technology Category

Application Category

📝 Abstract
This letter presents a novel deep reinforcement learning (DRL) approach for joint time allocation and power control in a cognitive Internet of Things (CIoT) system with simultaneous wireless information and power transfer (SWIPT). The CIoT transmitter autonomously manages energy harvesting (EH) and transmissions using a learnable time switching factor while optimizing power to enhance throughput and lifetime. The joint optimization is modeled as a Markov decision process under small-scale fading, realistic EH, and interference constraints. We develop a double deep Q-network (DDQN) enhanced with an upper confidence bound. Simulations benchmark our approach, showing superior performance over existing DRL methods.
Problem

Research questions and friction points this paper is trying to address.

Optimizes joint time and power management in SWIPT-EH CIoT systems
Enhances throughput and lifetime under fading and interference constraints
Develops a DDQN-based DRL approach for autonomous energy and transmission control
Innovation

Methods, ideas, or system contributions that make the work stand out.

DDQN with confidence bound for joint optimization
Learnable time switching factor for energy management
Markov decision process under fading and constraints
🔎 Similar Papers
No similar papers found.
N
Nadia Abdolkhani
Department of Electrical and Computer Engineering, Concordia University, Montreal, QC H3G 1M8, Canada
N
Nada Abdel Khalek
Department of Electrical and Computer Engineering, Concordia University, Montreal, QC H3G 1M8, Canada
Walaa Hamouda
Walaa Hamouda
Professor, Concordia Research Chair in AI-Enabled 5G and Beyond Wireless Communications, ECE Dept.
5G/6GAI for wireless communicationsmm-wave communicationsmassive-MIMOIoT/M2M
Iyad Dayoub
Iyad Dayoub
Université Polytecnique Hauts-de-France, 59313 Valenciennes, France