🤖 AI Summary
Image generation models frequently suffer from “hallucinations”—semantic distortions or structurally implausible outputs—arising from path deviations in flow matching (FM). To address this, we propose an iterative path correction and progressive refinement framework, the first to systematically integrate iterative optimization into the FM paradigm. Our method employs three core components: reweighted path optimization, gradient-guided dynamic correction, and multi-stage distribution alignment, enabling continuous trajectory refinement throughout generation. The framework is plug-and-play, fully compatible with existing FM models without architectural modification. Extensive experiments demonstrate significant hallucination suppression across multiple benchmarks, yielding 15–22% FID improvement over strong baselines. Crucially, our approach preserves high sampling efficiency and training stability. By explicitly modeling and correcting trajectory deviations, it establishes a new paradigm for enhancing generation fidelity and robustness in flow-based diffusion models.
📝 Abstract
Generative models for image generation are now commonly used for a wide variety of applications, ranging from guided image generation for entertainment to solving inverse problems. Nonetheless, training a generator is a non-trivial feat that requires fine-tuning and can lead to so-called hallucinations, that is, the generation of images that are unrealistic. In this work, we explore image generation using flow matching. We explain and demonstrate why flow matching can generate hallucinations, and propose an iterative process to improve the generation process. Our iterative process can be integrated into virtually $ extit{any}$ generative modeling technique, thereby enhancing the performance and robustness of image synthesis systems.