Controllable Traffic Simulation through LLM-Guided Hierarchical Chain-of-Thought Reasoning

📅 2024-09-23
🏛️ arXiv.org
📈 Citations: 4
Influential: 0
📄 PDF
🤖 AI Summary
Existing traffic simulation methods exhibit weak responsiveness to complex semantic instructions and poor controllability, hindering safety validation of autonomous driving systems across diverse scenarios. To address this, we propose the first LLM-guided hierarchical chain-of-thought mechanism, integrated with a geometry-aware cost function formulated in the Frenet coordinate system, enabling joint controllable generation of semantic descriptions and motion geometry. Our method synergistically combines diffusion models, large language models, hierarchical reasoning, and Frenet-based motion modeling. Evaluated on the Waymo Open Motion Dataset, it achieves a 23% improvement in generation efficiency and a 31% increase in semantic–geometric alignment accuracy over state-of-the-art diffusion-based approaches. The framework establishes a novel simulation paradigm for autonomous driving validation—characterized by high fidelity, interpretability, and strong controllability.

Technology Category

Application Category

📝 Abstract
Evaluating autonomous driving systems in complex and diverse traffic scenarios through controllable simulation is essential to ensure their safety and reliability. However, existing traffic simulation methods face challenges in their controllability. To address this, this paper proposes a novel diffusion-based and LLM-enhanced traffic simulation framework. Our approach incorporates a unique chain-of-thought (CoT) mechanism, which systematically examines the hierarchical structure of traffic elements and guides LLMs to thoroughly analyze traffic scenario descriptions step by step, enhancing their understanding of complex situations. Furthermore, we propose a Frenet-frame-based cost function framework that provides LLMs with geometrically meaningful quantities, improving their grasp of spatial relationships in a scenario and enabling more accurate cost function generation. Experiments on the Waymo Open Motion Dataset (WOMD) demonstrate that our method handles more intricate descriptions, generates a broader range of scenarios in a controllable manner, and outperforms existing diffusion-based methods in terms of efficiency.
Problem

Research questions and friction points this paper is trying to address.

Enhancing controllability in traffic simulation for autonomous driving evaluation
Improving LLM understanding of complex traffic scenarios through hierarchical reasoning
Generating accurate cost functions for better spatial relationship modeling
Innovation

Methods, ideas, or system contributions that make the work stand out.

Diffusion-based and LLM-enhanced traffic simulation framework
Hierarchical high-level understanding and low-level refinement modules
Frenet-frame-based cost function for spatial relationships
🔎 Similar Papers
No similar papers found.
Z
Zhiyuan Liu
School of Vehicle and Mobility, Tsinghua University
Leheng Li
Leheng Li
HKUST(GZ)
Computer Vision
Y
Yuning Wang
School of Vehicle and Mobility, Tsinghua University
Haotian Lin
Haotian Lin
Carnegie Mellon University
RoboticsAutonomous DrivingReinforcement Learning
Z
Zhizhe Liu
Department of Computer Sciences, University of Wisconsin-Madison
L
Lei He
School of Vehicle and Mobility, Tsinghua University
Jianqiang Wang
Jianqiang Wang
Associate Professor of Library and Information Studies, University at Buffalo
Information Retrievale-discovery