DeltaDeno: Zero-Shot Anomaly Generation via Delta-Denoising Attribution

📅 2025-11-20
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address overfitting in zero-shot anomaly generation caused by the absence of authentic anomaly samples—rendering conventional fine-tuning ineffective—this paper proposes a novel diffusion-based paradigm that requires neither training nor access to any anomaly samples. Methodologically, we design a dual-branch contrastive denoising architecture: subtle prompt perturbations induce divergent denoising trajectories, and accumulated denoising residuals enable interpretable, pixel-level anomaly localization. We further enhance generation stability and local controllability via token-level prompt refinement, latent-space localized inpainting, and constrained spatial attention biasing. To our knowledge, this is the first approach achieving high-fidelity, explainable, *fully* zero-shot localized anomaly synthesis—without any anomaly annotations or model adaptation. Extensive experiments on multiple public benchmarks demonstrate significant improvements in downstream anomaly detection performance.

Technology Category

Application Category

📝 Abstract
Anomaly generation is often framed as few-shot fine-tuning with anomalous samples, which contradicts the scarcity that motivates generation and tends to overfit category priors. We tackle the setting where no real anomaly samples or training are available. We propose Delta-Denoising (DeltaDeno), a training-free zero-shot anomaly generation method that localizes and edits defects by contrasting two diffusion branches driven by a minimal prompt pair under a shared schedule. By accumulating per-step denoising deltas into an image-specific localization map, we obtain a mask to guide the latent inpainting during later diffusion steps and preserve the surrounding context while generating realistic local defects. To improve stability and control, DeltaDeno performs token-level prompt refinement that aligns shared content and strengthens anomaly tokens, and applies a spatial attention bias restricted to anomaly tokens in the predicted region. Experiments on public datasets show that DeltaDeno achieves great generation, realism and consistent gains in downstream detection performance. Code will be made publicly available.
Problem

Research questions and friction points this paper is trying to address.

Generating realistic anomalies without training data or real samples
Localizing and editing defects using diffusion model contrast
Improving anomaly detection through zero-shot generation method
Innovation

Methods, ideas, or system contributions that make the work stand out.

Zero-shot anomaly generation without training
Delta-denoising attribution for defect localization
Token-level prompt refinement and spatial attention bias
🔎 Similar Papers
C
Chaoran Xu
Institute of Automation, Chinese Academy of Sciences
C
Chengkan Lv
Institute of Automation, Chinese Academy of Sciences
Qiyu Chen
Qiyu Chen
Institute of Automation, Chinese Academy of Sciences
Anomaly DetectionComputer VisionDeep Learning
Yunkang Cao
Yunkang Cao
Hunan University
Visual Anomaly DetectionIndustrial Foundation ModelEmbodied Intelligence
F
Feng Zhang
Institute of Automation, Chinese Academy of Sciences
Z
Zhengtao Zhang
Institute of Automation, Chinese Academy of Sciences