WeatherDG: LLM-assisted Procedural Weather Generation for Domain-Generalized Semantic Segmentation

📅 2024-10-15
🏛️ arXiv.org
📈 Citations: 2
Influential: 0
📄 PDF
🤖 AI Summary
Weak generalization of semantic segmentation models across weather domains (e.g., rain, fog, snow) and poor robustness to long-tailed classes (e.g., riders, motorcycles) hinder autonomous driving performance. To address this, we propose a programmatic weather image generation method that synergizes large language models (LLMs) with Stable Diffusion. Our approach innovatively integrates LLM-driven prompt engineering and tail-class-balanced sampling to jointly ensure weather diversity, scene plausibility, and high-fidelity long-tail class synthesis. Furthermore, we design a segmentation-model-agnostic domain generalization training framework compatible with any state-of-the-art (SOTA) segmenter. Evaluated on three cross-domain benchmarks (Cityscapes→ACDC, etc.), our method boosts the HRDA model’s mean Intersection-over-Union (mIoU) by 13.9%, significantly improving robustness to unseen weather conditions.

Technology Category

Application Category

📝 Abstract
In this work, we propose a novel approach, namely WeatherDG, that can generate realistic, weather-diverse, and driving-screen images based on the cooperation of two foundation models, i.e, Stable Diffusion (SD) and Large Language Model (LLM). Specifically, we first fine-tune the SD with source data, aligning the content and layout of generated samples with real-world driving scenarios. Then, we propose a procedural prompt generation method based on LLM, which can enrich scenario descriptions and help SD automatically generate more diverse, detailed images. In addition, we introduce a balanced generation strategy, which encourages the SD to generate high-quality objects of tailed classes under various weather conditions, such as riders and motorcycles. This segmentation-model-agnostic method can improve the generalization ability of existing models by additionally adapting them with the generated synthetic data. Experiments on three challenging datasets show that our method can significantly improve the segmentation performance of different state-of-the-art models on target domains. Notably, in the setting of ''Cityscapes to ACDC'', our method improves the baseline HRDA by 13.9% in mIoU.
Problem

Research questions and friction points this paper is trying to address.

Image Classification
Weather Variability
Rare Object Recognition
Innovation

Methods, ideas, or system contributions that make the work stand out.

WeatherDG
Stable Diffusion
Large Language Model
Chenghao Qian
Chenghao Qian
Ph.D student, University of Leeds | @Parallel Domain | Ex. XPENG
Generative ModelsAutonomous DrivingAdverse Weather
Y
Yuhu Guo
Department of Electrical and Computer Engineering, Carnegie Mellon University, USA
Y
Yuhong Mo
Department of Electrical and Computer Engineering, Carnegie Mellon University, USA
W
Wenjing Li
Transport Studies at Institute at University of Leeds, UK