LongTail Driving Scenarios with Reasoning Traces: The KITScenes LongTail Dataset

📅 2026-03-24
📈 Citations: 0
Influential: 0
📄 PDF
📝 Abstract
In real-world domains such as self-driving, generalization to rare scenarios remains a fundamental challenge. To address this, we introduce a new dataset designed for end-to-end driving that focuses on long-tail driving events. We provide multi-view video data, trajectories, high-level instructions, and detailed reasoning traces, facilitating in-context learning and few-shot generalization. The resulting benchmark for multimodal models, such as VLMs and VLAs, goes beyond safety and comfort metrics by evaluating instruction following and semantic coherence between model outputs. The multilingual reasoning traces in English, Spanish, and Chinese are from domain experts with diverse cultural backgrounds. Thus, our dataset is a unique resource for studying how different forms of reasoning affect driving competence. Our dataset is available at: https://hf.co/datasets/kit-mrt/kitscenes-longtail
Problem

Research questions and friction points this paper is trying to address.

long-tail scenarios
autonomous driving
generalization
rare events
multimodal reasoning
Innovation

Methods, ideas, or system contributions that make the work stand out.

long-tail driving scenarios
reasoning traces
multimodal benchmark
in-context learning
multilingual dataset
🔎 Similar Papers
No similar papers found.
Royden Wagner
Royden Wagner
PhD student at KIT
GenAIself-supervised learningmechanistic interpretability
Ömer Şahin Taş
Ömer Şahin Taş
Karlsruhe Institute of Technology (KIT) & FZI Research Center for Information Technology
Reinforcement LearningSelf-Supervised LearningAutonomous Driving
J
Jaime Villa
University Charles III of Madrid
Felix Hauser
Felix Hauser
PhD student, research assistant, Karlsruhe Institute of Technology (KIT)
Machine learningExplainable AI
Yinzhe Shen
Yinzhe Shen
PhD student at KIT
end-to-end autonomous driving
Marlon Steiner
Marlon Steiner
Research Associate at KIT
motion planningmotion predictionmachine learningreinforcement learning
D
Dominik Strutz
Karlsruhe Institute of Technology (KIT)
C
Carlos Fernandez
Karlsruhe Institute of Technology (KIT)
C
Christian Kinzig
Karlsruhe Institute of Technology (KIT)
G
Guillermo S. Guitierrez-Cabello
Technical University of Madrid
Hendrik Königshof
Hendrik Königshof
FZI Research Center for Information Technology | Karlsruhe Institute of Technology (KIT)
Environment PerceptionLocalizationPlanning
Fabian Immel
Fabian Immel
PhD Student, Karlsruhe Institute of Technology / Research Scientist at FZI Forschungszentrum
Autonomous SystemsMachine LearningComputer Vision
R
Richard Schwarzkopf
Karlsruhe Institute of Technology (KIT)
N
Nils Alexander Rack
Karlsruhe Institute of Technology (KIT)
Kevin Rösch
Kevin Rösch
FZI Forschungszentrum Informatik
Data augmentationAutomated drivingSynthetic data generation
K
Kaiwen Wang
Karlsruhe Institute of Technology (KIT)
Jan-Hendrik Pauls
Jan-Hendrik Pauls
Research Group Leader Mapping and Localization, Karlsruhe Institute of Technology (KIT)
MappingLocalizationMap Learning
Martin Lauer
Martin Lauer
Karlsruhe Institute of Technology
autonomous drivingmachine learningcomputer vision
Igor Gilitschenski
Igor Gilitschenski
Assistant Professor, University of Toronto
RoboticsMachine LearningComputer Vision
Holger Caesar
Holger Caesar
Assistant Professor at TU Delft
Autonomous VehiclesComputer Vision
Christoph Stiller
Christoph Stiller
Professor of Measurement and Control Systems, Karlsruher Institut für Technologie, KIT, FZI
intelligent vehiclesautonomous drivingautonomous vehiclesintelligent transportation systems