CorrectAD: A Self-Correcting Agentic System to Improve End-to-end Planning in Autonomous Driving

📅 2025-11-17

📈 Citations: 0

✨ Influential: 0

career value

224K/year

🤖 AI Summary

To address the robustness deficiency of end-to-end autonomous driving planners under long-tail scenarios, this paper proposes the first world-model-based fully automated self-correction system. Methodologically, it introduces (1) PM-Agent—a novel agent that autonomously generates structured data requirements to establish a closed-loop correction pipeline; (2) DriveSora, the first diffusion-based video generation model aligned with 3D scene layouts and ensuring spatiotemporal consistency for high-fidelity driving video synthesis; and (3) a scalable data repair pipeline integrating diffusion video generation, 3D layout control, and agent-centric architecture. Evaluated on nuScenes and a proprietary dataset, the system corrects 62.5% and 49.8% of planner failure cases, respectively, while reducing collision rates by 39% and 27%. These results demonstrate substantial improvements in safety and generalization across diverse end-to-end planning architectures.

Technology Category

Application Category

📝 Abstract

End-to-end planning methods are the de facto standard of the current autonomous driving system, while the robustness of the data-driven approaches suffers due to the notorious long-tail problem (i.e., rare but safety-critical failure cases). In this work, we explore whether recent diffusion-based video generation methods (a.k.a. world models), paired with structured 3D layouts, can enable a fully automated pipeline to self-correct such failure cases. We first introduce an agent to simulate the role of product manager, dubbed PM-Agent, which formulates data requirements to collect data similar to the failure cases. Then, we use a generative model that can simulate both data collection and annotation. However, existing generative models struggle to generate high-fidelity data conditioned on 3D layouts. To address this, we propose DriveSora, which can generate spatiotemporally consistent videos aligned with the 3D annotations requested by PM-Agent. We integrate these components into our self-correcting agentic system, CorrectAD. Importantly, our pipeline is an end-to-end model-agnostic and can be applied to improve any end-to-end planner. Evaluated on both nuScenes and a more challenging in-house dataset across multiple end-to-end planners, CorrectAD corrects 62.5% and 49.8% of failure cases, reducing collision rates by 39% and 27%, respectively.

Problem

Research questions and friction points this paper is trying to address.

Improving robustness of data-driven autonomous driving planning systems

Addressing long-tail failure cases in end-to-end planning methods

Automating self-correction of safety-critical driving failures

Innovation

Methods, ideas, or system contributions that make the work stand out.

Self-correcting agentic system for autonomous driving

Diffusion-based video generation with 3D layouts

Automated pipeline to correct planning failures

🔎 Similar Papers

No similar papers found.