GroundingAnomaly: Spatially-Grounded Diffusion for Few-Shot Anomaly Synthesis

📅 2026-04-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of limited real anomalous samples in industrial visual inspection, which hinders the performance of anomaly detection methods. Existing synthesis approaches often suffer from poorly integrated anomalies or inaccurate masks, limiting their effectiveness. To overcome these issues, the authors propose a spatially guided diffusion model framework that leverages semantic maps to precisely control the location and morphology of synthesized anomalies. By incorporating a spatial conditioning module and a gated self-attention mechanism into a frozen U-Net, the method enables pixel-level semantic guidance and efficient conditioning while preserving pre-trained priors and supporting few-shot adaptation. The approach generates high-quality anomalous samples on MVTec AD and VisA benchmarks and achieves state-of-the-art performance in anomaly detection, segmentation, and instance-level tasks.
📝 Abstract
The performance of visual anomaly inspection in industrial quality control is often constrained by the scarcity of real anomalous samples. Consequently, anomaly synthesis techniques have been developed to enlarge training sets and enhance downstream inspection. However, existing methods either suffer from poor integration caused by inpainting or fail to provide accurate masks. To address these limitations, we propose GroundingAnomaly, a novel few-shot anomaly image generation framework. Our framework introduces a Spatial Conditioning Module that leverages per-pixel semantic maps to enable precise spatial control over the synthesized anomalies. Furthermore, a Gated Self-Attention Module is designed to inject conditioning tokens into a frozen U-Net via gated attention layers. This carefully preserves pretrained priors while ensuring stable few-shot adaptation. Extensive evaluations on the MVTec AD and VisA datasets demonstrate that GroundingAnomaly generates high-quality anomalies and achieves state-of-the-art performance across multiple downstream tasks, including anomaly detection, segmentation, and instance-level detection.
Problem

Research questions and friction points this paper is trying to address.

anomaly synthesis
few-shot learning
industrial quality control
anomaly detection
spatial grounding
Innovation

Methods, ideas, or system contributions that make the work stand out.

Spatial Conditioning
Gated Self-Attention
Few-Shot Anomaly Synthesis
Diffusion Model
Anomaly Segmentation
🔎 Similar Papers
No similar papers found.
Y
Yishen Liu
Beijing Institute of Technology
H
Hongcang Chen
Beijing Institute of Technology
Pengcheng Zhao
Pengcheng Zhao
University of Michigan
Control theoryoptimal control
Y
Yunfan Bao
Beijing Institute of Technology
Y
Yuxi Tian
Beijing Institute of Technology
J
Jieming Zhang
Li Auto
H
Hao Chen
Li Auto
Z
Zheng Zhi
Li Auto
Y
Yongchun Liu
Li Auto
Ying Li
Ying Li
Associate Professor, Beijing Institute of Technology
autonomous drivingflying vehicle perceptionrobot navigation
Dongpu Cao
Dongpu Cao
Professor, Tsinghua University
Automated DrivingVehicle ControlVehicle DynamicsHuman-Centered AI