Multi-Objective Reinforcement Learning for Cognitive Radar Resource Management

📅 2025-06-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the time-allocation problem in multifunctional cognitive radar systems operating in dynamic environments, where simultaneous optimization of new-target search and known-target tracking is required. We propose a Pareto-optimal scheduling framework based on multi-objective deep reinforcement learning (MO-DRL). Methodologically, we integrate NSGA-II to estimate the upper bound of the Pareto front for modeling multi-objective trade-offs, and employ both DDPG and SAC to learn adaptive time-allocation policies. Experimental results demonstrate that the proposed framework significantly enhances environmental adaptability; SAC outperforms DDPG in policy stability and sample efficiency, validating the effectiveness and advancement of MO-DRL for radar resource scheduling. This study establishes a scalable optimization paradigm for intelligent temporal decision-making in cognitive radar systems.

Technology Category

Application Category

📝 Abstract
The time allocation problem in multi-function cognitive radar systems focuses on the trade-off between scanning for newly emerging targets and tracking the previously detected targets. We formulate this as a multi-objective optimization problem and employ deep reinforcement learning to find Pareto-optimal solutions and compare deep deterministic policy gradient (DDPG) and soft actor-critic (SAC) algorithms. Our results demonstrate the effectiveness of both algorithms in adapting to various scenarios, with SAC showing improved stability and sample efficiency compared to DDPG. We further employ the NSGA-II algorithm to estimate an upper bound on the Pareto front of the considered problem. This work contributes to the development of more efficient and adaptive cognitive radar systems capable of balancing multiple competing objectives in dynamic environments.
Problem

Research questions and friction points this paper is trying to address.

Balancing scanning and tracking in cognitive radar systems
Finding Pareto-optimal solutions via multi-objective reinforcement learning
Comparing DDPG and SAC algorithms for radar resource management
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-objective deep reinforcement learning for radar
Compare DDPG and SAC algorithms effectively
NSGA-II estimates upper bound Pareto front
🔎 Similar Papers
No similar papers found.
Ziyang Lu
Ziyang Lu
University at Buffalo
Wireless communicationMachine learningResource Allocation
Subodh Kalia
Subodh Kalia
Syracuse University
OptimizationFinite Element Methods
M. Cenk Gursoy
M. Cenk Gursoy
Electrical Engineering and Computer Science, Syracuse University
Wireless CommunicationsMachine LearningInformation TheorySignal ProcessingWireless Networks
C
Chilukuri K. Mohan
Department of Electrical Engineering and Computer Science, Syracuse University, Syracuse NY, 13066
P
Pramod K. Varshney
Department of Electrical Engineering and Computer Science, Syracuse University, Syracuse NY, 13066