CSAOT: Cooperative Multi-Agent System for Active Object Tracking

📅 2025-01-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the poor robustness of single-camera active object tracking (AOT) in complex dynamic scenes—particularly under occlusion and rapid target motion—this paper proposes a single-device, multi-agent collaborative framework. We design a lightweight multi-agent deep reinforcement learning model based on a Mixture-of-Experts (MoE) architecture, enabling role specialization and cooperative policy learning to jointly control camera viewpoints on a single hardware platform—without requiring auxiliary cameras. Our approach integrates dynamic environment modeling with simulation-based training and is evaluated across diverse maps containing both static and dynamic obstacles. Experimental results demonstrate a 37.2% increase in average tracking duration, a 2.1× improvement in occlusion recovery speed, and a 91.4% tracking success rate—significantly outperforming both single-agent baselines and external multi-camera approaches.

Technology Category

Application Category

📝 Abstract
Object Tracking is essential for many computer vision applications, such as autonomous navigation, surveillance, and robotics. Unlike Passive Object Tracking (POT), which relies on static camera viewpoints to detect and track objects across consecutive frames, Active Object Tracking (AOT) requires a controller agent to actively adjust its viewpoint to maintain visual contact with a moving target in complex environments. Existing AOT solutions are predominantly single-agent-based, which struggle in dynamic and complex scenarios due to limited information gathering and processing capabilities, often resulting in suboptimal decision-making. Alleviating these limitations necessitates the development of a multi-agent system where different agents perform distinct roles and collaborate to enhance learning and robustness in dynamic and complex environments. Although some multi-agent approaches exist for AOT, they typically rely on external auxiliary agents, which require additional devices, making them costly. In contrast, we introduce the Collaborative System for Active Object Tracking (CSAOT), a method that leverages multi-agent deep reinforcement learning (MADRL) and a Mixture of Experts (MoE) framework to enable multiple agents to operate on a single device, thereby improving tracking performance and reducing costs. Our approach enhances robustness against occlusions and rapid motion while optimizing camera movements to extend tracking duration. We validated the effectiveness of CSAOT on various interactive maps with dynamic and stationary obstacles.
Problem

Research questions and friction points this paper is trying to address.

Active Object Tracking
Multi-Camera Systems
Cost-Efficiency
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-camera Active Object Tracking
Multi-agent Deep Reinforcement Learning
Hybrid Expert Models
🔎 Similar Papers
No similar papers found.
H
Hy Nguyen
Applied Artificial Intelligence Institute, Deakin University, Melbourne, Australia
B
Bao Pham
Applied Artificial Intelligence Institute, Deakin University, Melbourne, Australia
Hung Du
Hung Du
Applied Artificial Intelligence Institute - Deakin University
Deep Reinforcement LearningMulti-agent SystemsContext-aware SystemsTranslational Research
S
Srikanth Thudumu
Applied Artificial Intelligence Institute, Deakin University, Melbourne, Australia
Rajesh Vasa
Rajesh Vasa
Head of Translational Research, Applied Artificial Intelligence Institute, Deakin University
Artificial IntelligenceSoftware EvolutionAutomated Software EngineeringTools
K
K. Mouzakis
Applied Artificial Intelligence Institute, Deakin University, Melbourne, Australia