Generative AI for Autonomous Driving: Frontiers and Opportunities

📅 2025-05-13
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper systematically investigates the key pathways and core challenges in leveraging generative AI (GenAI) for L5-level fully autonomous driving. Addressing critical bottlenecks—including poor generalization to long-tail scenarios, weak safety verifiability, and difficulty in cross-modal coordination—the authors propose the first comprehensive mapping framework linking GenAI capabilities with the full autonomous driving stack: multimodal perception generation, large language model (LLM)-driven decision-making, synthetic data construction, occupancy prediction, end-to-end policy learning, and digital twin simulation. A three-dimensional research roadmap is introduced, jointly optimizing technical robustness, safety verifiability, and socio-technical impact. The work establishes GenAI4AD, an open-source literature repository, and delivers a unified technology map spanning theory, algorithms, systems, and governance—providing both a methodological foundation and authoritative benchmark for reliable L5 autonomy.

Technology Category

Application Category

📝 Abstract
Generative Artificial Intelligence (GenAI) constitutes a transformative technological wave that reconfigures industries through its unparalleled capabilities for content creation, reasoning, planning, and multimodal understanding. This revolutionary force offers the most promising path yet toward solving one of engineering's grandest challenges: achieving reliable, fully autonomous driving, particularly the pursuit of Level 5 autonomy. This survey delivers a comprehensive and critical synthesis of the emerging role of GenAI across the autonomous driving stack. We begin by distilling the principles and trade-offs of modern generative modeling, encompassing VAEs, GANs, Diffusion Models, and Large Language Models (LLMs). We then map their frontier applications in image, LiDAR, trajectory, occupancy, video generation as well as LLM-guided reasoning and decision making. We categorize practical applications, such as synthetic data workflows, end-to-end driving strategies, high-fidelity digital twin systems, smart transportation networks, and cross-domain transfer to embodied AI. We identify key obstacles and possibilities such as comprehensive generalization across rare cases, evaluation and safety checks, budget-limited implementation, regulatory compliance, ethical concerns, and environmental effects, while proposing research plans across theoretical assurances, trust metrics, transport integration, and socio-technical influence. By unifying these threads, the survey provides a forward-looking reference for researchers, engineers, and policymakers navigating the convergence of generative AI and advanced autonomous mobility. An actively maintained repository of cited works is available at https://github.com/taco-group/GenAI4AD.
Problem

Research questions and friction points this paper is trying to address.

Exploring GenAI's role in achieving Level 5 autonomous driving
Analyzing generative models for synthetic data and decision-making in AD
Addressing challenges like safety, ethics, and scalability in GenAI for AD
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses generative models like VAEs, GANs, Diffusion Models, LLMs
Applies GenAI in synthetic data and digital twins
Addresses safety, ethics, and regulatory compliance challenges
🔎 Similar Papers
No similar papers found.
Y
Yuping Wang
Texas A&M University
Shuo Xing
Shuo Xing
Texas A&M University
Large Language ModelsNatural Language ProcessingMachine Learning
C
Cui Can
Purdue University
R
Renjie Li
Texas A&M University
H
Hongyuan Hua
Texas A&M University
Kexin Tian
Kexin Tian
Texas A&M University
Autonomous DrivingVision-Language Models
Zhaobin Mo
Zhaobin Mo
Columbia University
Physics-informed Deep LearningGenerative Adversarial NetworksReinforcement Learning
Xiangbo Gao
Xiangbo Gao
Texas A&M University
collaborative autonomous drivingmulti-agent systemcomputer vision
Keshu Wu
Keshu Wu
Postdoctoral Research Associate, Texas A&M University
Autonomous DrivingArtificial IntelligenceIntelligent TransportationDigital Twin
Sulong Zhou
Sulong Zhou
Texas A&M University, University of Wisconsin-Madison
GeoAIAI EthicsInformaticsPublic HealthUI/UX and Software Design
H
Hengxu You
University of Florida
Juntong Peng
Juntong Peng
Purdue University
Autonomous DrivingCollaborative PerceptionFoundation ModelIntelligent Transportation
Junge Zhang
Junge Zhang
PhD student, UCR
deep Learningcomputer vision
Z
Zehao Wang
University of California, Riverside
R
Rui Song
Technische Universität München
Mingxuan Yan
Mingxuan Yan
University of California, Riverside
Walter Zimmer
Walter Zimmer
Technical University of Munich (TUM)
Autonomous DrivingComputer VisionIntelligent Transportation SystemsFoundation ModelsRobotics
Xingcheng Zhou
Xingcheng Zhou
Technical University of Munich
Autonomous DrivingVision Language ModelLarge Language ModelComputer Vision
Peiran Li
Peiran Li
The University of Tokyo
Z
Zhaohan Lu
University of Michigan
C
Chia-Ju Chen
University of Texas at Austin
Y
Yue Huang
University of Notre Dame
Ryan A. Rossi
Ryan A. Rossi
Adobe Research
Machine LearningPersonalizationGraph Representation LearningGraph MLGraph Theory
L
Lichao Sun
Lehigh University
Hongkai Yu
Hongkai Yu
Associate Professor, Cleveland State University
Computer VisionMachine LearningIntelligent TransportationIntelligent Vehicles
Z
Zhiwen Fan
Texas A&M University, University of Texas at Austin
F
Frank Hao Yang
Johns Hopkins University
Yuhao Kang
Yuhao Kang
Assistant Professor, GISense Lab, The University of Texas at Austin & GISphere
GIScienceGeospatial Data ScienceGeoAICartographyUrban Visual Intelligence
Ross Greer
Ross Greer
University of California Merced
Artificial IntelligenceMachine VisionAutonomous DrivingHuman-Robot InteractionComputer Music
C
Chenxi Liu
University of Utah
E
Eun Hak Lee
Texas A&M Transportation Institute
X
Xuan Di
Columbia University
X
Xinyue Ye
Texas A&M University
Liu Ren
Liu Ren
VP and Chief Scientist, Scalable and Assistive AI, ADAS AI, Bosch Research North America (BCAI)
Artificial IntelligenceADAS AIVisual AnalyticsComputer Vision
Alois Knoll
Alois Knoll
Technische Universität München
RoboticsAISensor Data FusionAutonomous DrivingCyber Physical Systems
X
Xiaopeng Li
University of Wisconsin-Madison
S
Shuiwang Ji
Texas A&M University
Masayoshi Tomizuka
Masayoshi Tomizuka
Mechaniccal Engineering, University of California
mechanical engineeringdynamic systemscontrolmechatronics
Marco Pavone
Marco Pavone
Stanford University and NVIDIA
RoboticsControl TheoryDistributed ControlIntelligent Transportation systems
Tianbao Yang
Tianbao Yang
Texas A&M University
machine learningstochastic optimization
Jing Du
Jing Du
Postdoctoral Research Fellow at University of New South Wales
Recommender SystemBrain Network AnalysisDisease Spread PredicionTime-Series Analysis
Ming-Hsuan Yang
Ming-Hsuan Yang
University of California at Merced; Google DeepMind
Computer VisionMachine LearningArtificial Intelligence
Hua Wei
Hua Wei
School of Computing and Augmented Intelligence, Arizona State University
Data MiningMachine LearningReinforcement Learning
Ziran Wang
Ziran Wang
Purdue University
Autonomous DrivingDigital TwinHuman-Centered AIHuman-Autonomy TeamingIntelligent Vehicles
Y
Yang Zhou
Texas A&M University
J
Jiachen Li
University of California, Riverside
Zhengzhong Tu
Zhengzhong Tu
Texas A&M University, Google Research, University of Texas at Austin
Agentic AITrustworthy AIEmbodied AI