FlowDrive: Energy Flow Field for End-to-End Autonomous Driving

📅 2025-09-17
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing end-to-end autonomous driving methods rely on implicit bird’s-eye-view (BEV) features and lack explicit modeling of risk and guidance priors, resulting in insufficient planning safety and interpretability. To address this, we propose the Energy Flow Field (EFF) framework: it explicitly constructs a risk potential field and a lane-attracting field in BEV space, endowing trajectory planning with physical interpretability; introduces a feature-level gating mechanism to decouple motion intent prediction from trajectory denoising, enhancing multimodal planning diversity and robustness; and integrates BEV semantic prior encoding with a conditional diffusion-based planner for high-fidelity trajectory generation. Evaluated on the NAVSIM v2 benchmark, EFF achieves 86.3 EPDMS—significantly outperforming prior methods—while simultaneously ensuring safety, interpretability, and planning quality.

Technology Category

Application Category

📝 Abstract
Recent advances in end-to-end autonomous driving leverage multi-view images to construct BEV representations for motion planning. In motion planning, autonomous vehicles need considering both hard constraints imposed by geometrically occupied obstacles (e.g., vehicles, pedestrians) and soft, rule-based semantics with no explicit geometry (e.g., lane boundaries, traffic priors). However, existing end-to-end frameworks typically rely on BEV features learned in an implicit manner, lacking explicit modeling of risk and guidance priors for safe and interpretable planning. To address this, we propose FlowDrive, a novel framework that introduces physically interpretable energy-based flow fields-including risk potential and lane attraction fields-to encode semantic priors and safety cues into the BEV space. These flow-aware features enable adaptive refinement of anchor trajectories and serve as interpretable guidance for trajectory generation. Moreover, FlowDrive decouples motion intent prediction from trajectory denoising via a conditional diffusion planner with feature-level gating, alleviating task interference and enhancing multimodal diversity. Experiments on the NAVSIM v2 benchmark demonstrate that FlowDrive achieves state-of-the-art performance with an EPDMS of 86.3, surpassing prior baselines in both safety and planning quality. The project is available at https://astrixdrive.github.io/FlowDrive.github.io/.
Problem

Research questions and friction points this paper is trying to address.

Modeling risk and guidance priors for safe autonomous driving
Integrating semantic and safety cues into BEV representations
Decoupling motion intent prediction from trajectory denoising
Innovation

Methods, ideas, or system contributions that make the work stand out.

Energy-based flow fields for semantic priors
Conditional diffusion planner with gating
Adaptive refinement of anchor trajectories
🔎 Similar Papers
No similar papers found.
H
Hao Jiang
Shanghai Jiao Tong University
Zhipeng Zhang
Zhipeng Zhang
School of Artificial Intelligence, Shanghai Jiao Tong University
Computer Vision,Object Tracking and Segmentation
Y
Yu Gao
Bosch Corporate Research, Shanghai, China
Z
Zhigang Sun
Bosch Corporate Research, Shanghai, China
Yiru Wang
Yiru Wang
University of Pittsburgh
Econometrics
Y
Yuwen Heng
Bosch Corporate Research, Shanghai, China
S
Shuo Wang
Bosch Corporate Research, Shanghai, China
J
Jinhao Chai
Shanghai University, Shanghai, China
Z
Zhuo Chen
Shanghai Jiao Tong University
H
Hao Zhao
AIR, Tsinghua University
H
Hao Sun
Bosch Corporate Research, Shanghai, China
X
Xi Zhang
Shanghai Jiao Tong University
A
Anqing Jiang
Bosch Corporate Research, Shanghai, China
Chuan Hu
Chuan Hu
Associate Professor of Mechanical Engineering, Shanghai Jiao Tong University
Autonomous DrivingDecision and PlanningHMIHuman-AI CollaborationHiL RL