HetroD: A High-Fidelity Drone Dataset and Benchmark for Autonomous Driving in Heterogeneous Traffic

📅 2026-02-03

📈 Citations: 0

✨ Influential: 0

career value

230K/year

🤖 AI Summary

Existing autonomous driving datasets struggle to capture the complex, unstructured interactions of vulnerable road users—such as pedestrians and cyclists—in high-density heterogeneous traffic, including behaviors like hook turns and lane weaving. To address this gap, this work presents a large-scale, high-fidelity traffic dataset collected via drone-based aerial imagery, featuring centimeter-accurate trajectory annotations, high-definition maps, and synchronized traffic signal states. A modular toolchain is provided to support downstream tasks. The released dataset comprises over 65.4k trajectories, with 70% corresponding to vulnerable road users, and establishes the first standardized benchmark for such scenarios. Evaluations reveal a significant performance drop in current prediction and planning models, underscoring the need for novel approaches tailored to these challenging interactions.

Technology Category

Application Category

📝 Abstract

We present HetroD, a dataset and benchmark for developing autonomous driving systems in heterogeneous environments. HetroD targets the critical challenge of navi- gating real-world heterogeneous traffic dominated by vulner- able road users (VRUs), including pedestrians, cyclists, and motorcyclists that interact with vehicles. These mixed agent types exhibit complex behaviors such as hook turns, lane splitting, and informal right-of-way negotiation. Such behaviors pose significant challenges for autonomous vehicles but remain underrepresented in existing datasets focused on structured, lane-disciplined traffic. To bridge the gap, we collect a large- scale drone-based dataset to provide a holistic observation of traffic scenes with centimeter-accurate annotations, HD maps, and traffic signal states. We further develop a modular toolkit for extracting per-agent scenarios to support downstream task development. In total, the dataset comprises over 65.4k high- fidelity agent trajectories, 70% of which are from VRUs. HetroD supports modeling of VRU behaviors in dense, het- erogeneous traffic and provides standardized benchmarks for forecasting, planning, and simulation tasks. Evaluation results reveal that state-of-the-art prediction and planning models struggle with the challenges presented by our dataset: they fail to predict lateral VRU movements, cannot handle unstructured maneuvers, and exhibit limited performance in dense and multi-agent scenarios, highlighting the need for more robust approaches to heterogeneous traffic. See our project page for more examples: https://hetroddata.github.io/HetroD/

Problem

Research questions and friction points this paper is trying to address.

heterogeneous traffic

vulnerable road users

autonomous driving

unstructured behavior

trajectory prediction

Innovation

Methods, ideas, or system contributions that make the work stand out.

heterogeneous traffic

vulnerable road users (VRUs)

drone-based dataset