Challenger: Affordable Adversarial Driving Video Generation

📅 2025-05-21
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing adversarial driving video generation methods predominantly rely on abstract trajectories or bird’s-eye-view (BEV) representations, lacking realistic sensor data and thus failing to effectively stress-test autonomous driving systems. To address this, we propose the first framework jointly optimizing for physical plausibility and high-fidelity sensor observations. Our method introduces a multi-round physics-aware trajectory refinement mechanism and a transferability-driven trajectory scoring function, integrating vehicle dynamics modeling, multi-agent interaction optimization, BEV-to-video cross-modal synthesis, and multi-view rendering. Evaluated on nuScenes, our framework successfully generates diverse adversarial scenarios—including cut-in and blind-spot intrusion—with physically coherent motion and photorealistic sensor outputs. It significantly increases collision rates of end-to-end models such as UniAD under black-box evaluation, while demonstrating strong cross-model transferability across diverse autonomous driving architectures.

Technology Category

Application Category

📝 Abstract
Generating photorealistic driving videos has seen significant progress recently, but current methods largely focus on ordinary, non-adversarial scenarios. Meanwhile, efforts to generate adversarial driving scenarios often operate on abstract trajectory or BEV representations, falling short of delivering realistic sensor data that can truly stress-test autonomous driving (AD) systems. In this work, we introduce Challenger, a framework that produces physically plausible yet photorealistic adversarial driving videos. Generating such videos poses a fundamental challenge: it requires jointly optimizing over the space of traffic interactions and high-fidelity sensor observations. Challenger makes this affordable through two techniques: (1) a physics-aware multi-round trajectory refinement process that narrows down candidate adversarial maneuvers, and (2) a tailored trajectory scoring function that encourages realistic yet adversarial behavior while maintaining compatibility with downstream video synthesis. As tested on the nuScenes dataset, Challenger generates a diverse range of aggressive driving scenarios-including cut-ins, sudden lane changes, tailgating, and blind spot intrusions-and renders them into multiview photorealistic videos. Extensive evaluations show that these scenarios significantly increase the collision rate of state-of-the-art end-to-end AD models (UniAD, VAD, SparseDrive, and DiffusionDrive), and importantly, adversarial behaviors discovered for one model often transfer to others.
Problem

Research questions and friction points this paper is trying to address.

Generating affordable adversarial driving videos for stress-testing autonomous systems
Optimizing traffic interactions and high-fidelity sensor observations jointly
Enhancing collision rates in state-of-the-art AD models via diverse scenarios
Innovation

Methods, ideas, or system contributions that make the work stand out.

Physics-aware multi-round trajectory refinement process
Tailored trajectory scoring function for adversarial behavior
Multiview photorealistic video rendering of aggressive scenarios
🔎 Similar Papers
No similar papers found.
Z
Zhiyuan Xu
AIR, Tsinghua, UCAS
B
Bohan Li
AIR, Tsinghua, SJTU
Huan-ang Gao
Huan-ang Gao
Ph.D. student, Tsinghua University
AgentVision & Robotics
Mingju Gao
Mingju Gao
Unknown affiliation
Computer VisionRobotics
Y
Yong Chen
Geely Auto
M
Ming Liu
Geely Auto
C
Chenxu Yan
Geely Auto
H
Hang Zhao
IIIS, Tsinghua
S
Shuo Feng
DA, Tsinghua
H
Hao Zhao
AIR, Tsinghua