Vehicle Dynamics Embedded World Models for Autonomous Driving

๐Ÿ“… 2025-12-02
๐Ÿ›๏ธ IEEE transactions on intelligent transportation systems (Print)
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Current driving world models jointly model ego-vehicle dynamics and environmental dynamics, resulting in poor cross-vehicle generalization and low robustness to dynamical variations. To address this, we propose VDD, a decoupled driving world model that explicitly embeds and isolates ego-dynamics modeling within the world modelโ€”achieving principled decoupling from environmental dynamics for the first time. VDD introduces a dynamics-aware state decomposition mechanism and a two-stage optimization framework: Policy Adaptation at Deployment (PAD) and Policy Augmentation during Training (PAT). The method integrates latent-variable modeling, sequential decision learning, and dynamics-constrained policy gradient optimization. Experiments demonstrate that VDD significantly improves driving performance and robustness to vehicle parameter variations in simulation, outperforming existing state-of-the-art methods in cross-vehicle generalization.

Technology Category

Application Category

๐Ÿ“ Abstract
World models have gained significant attention as a promising approach for autonomous driving. By emulating human-like perception and decision-making processes, these models can predict and adapt to dynamic environments. Existing methods typically map high-dimensional observations into compact latent spaces and learn optimal policies within these latent representations. However, prior work usually jointly learns ego-vehicle dynamics and environmental transition dynamics from the image input, leading to inefficiencies and a lack of robustness to variations in vehicle dynamics. To address these issues, we propose the Vehicle Dynamics embedded Dreamer (VDD) method, which decouples the modeling of ego-vehicle dynamics from environmental transition dynamics. This separation allows the world model to generalize effectively across vehicles with diverse parameters. Additionally, we introduce two strategies to further enhance the robustness of the learned policy: Policy Adjustment during Deployment (PAD) and Policy Augmentation during Training (PAT). Comprehensive experiments in simulated environments demonstrate that the proposed model significantly improves both driving performance and robustness to variations in vehicle dynamics, outperforming existing approaches.
Problem

Research questions and friction points this paper is trying to address.

Decouples ego-vehicle dynamics from environmental transitions
Enhances generalization across vehicles with diverse parameters
Improves driving performance and robustness to dynamics variations
Innovation

Methods, ideas, or system contributions that make the work stand out.

Decouples ego-vehicle dynamics from environmental dynamics
Introduces Policy Adjustment during Deployment strategy
Uses Policy Augmentation during Training for robustness
H
Huiqian Li
School of Vehicle and Mobility, Tsinghua University, Beijing 100084, China
W
Wei Pan
Department of Computer Science, The University of Manchester, M13 9PL Manchester, U.K.
Haodong Zhang
Haodong Zhang
Control Science and Engineering, Zhejiang University
RoboticsAI
J
Jin Huang
School of Vehicle and Mobility, Tsinghua University, Beijing 100084, China
Zhihua Zhong
Zhihua Zhong
Tsinghua University, Chinese Academy of Engineering, Beijing 100088, China