Active Scout: Multi-Target Tracking Using Neural Radiance Fields in Dense Urban Environments

📅 2024-06-11

🏛️ IEEE/RJS International Conference on Intelligent RObots and Systems

📈 Citations: 1

✨ Influential: 0

career value

190K/year

🤖 AI Summary

In dense urban environments with high-rise buildings, severe occlusions frequently cause tracking failure for dynamic targets. Method: This paper proposes a drone-based multi-target active collaborative tracking framework leveraging online neural radiance fields (NeRF). It pioneers the integration of online NeRF mapping with information-gain-driven active perception, synergizing RGB-D multi-view fusion, OpenStreetMap-informed simulation environment modeling, and reinforcement learning–inspired trajectory planning. The framework enables end-to-end, first-principles–guided target switching and concurrent map self-optimization. Results: Experiments demonstrate that under strong dynamic occlusion, the maximum tracking error is reduced to 200 m—significantly lower than the baseline of 600 m—and 20 static targets are precisely localized within 300 time steps. Moreover, NeRF reconstruction quality exhibits a positive correlation with tracking accuracy, empirically validating the efficacy of the closed-loop perception–mapping–decision pipeline.

Technology Category

Application Category

📝 Abstract

We study pursuit-evasion games in highly occluded urban environments, e.g. tall buildings in a city, where a scout (quadrotor) tracks multiple dynamic targets on the ground. We show that we can build a neural radiance field (NeRF) representation of the city—online—using RGB and depth images from different vantage points. This representation is used to calculate the information gain to both explore unknown parts of the city and track the targets—thereby giving a completely first-principles approach to actively tracking dynamic targets. We demonstrate, using a custom-built simulator using Open Street Maps data of Philadelphia and New York City, that we can explore and locate 20 stationary targets within 300 steps. This is slower than a greedy baseline, which does not use active perception. But for dynamic targets that actively hide behind occlusions, we show that our approach maintains, at worst, a tracking error of 200m; the greedy baseline can have a tracking error as large as 600m. We observe a number of interesting properties in the scout’s policies, e.g., it switches its attention to track a different target periodically, as the quality of the NeRF representation improves over time, the scout also becomes better in terms of target tracking.

Problem

Research questions and friction points this paper is trying to address.

Track multiple dynamic targets in dense urban environments

Build online NeRF representations for exploration and tracking

Compare performance with greedy baselines in occlusion-rich settings

Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses NeRF for real-time urban scene modeling

Calculates information gain for exploration and tracking

Demonstrates dynamic multi-target tracking in occlusion

🔎 Similar Papers

No similar papers found.

Field AI

Irvine, CA

Parking Perception DNN Engineer

Nvidia

base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5; equity and benefits

US, CA, Santa Clara

Authors to Follow