Structure and Progress Aware Diffusion for Medical Image Segmentation

📅 2026-03-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Medical image segmentation often suffers from performance degradation due to ambiguous boundaries and noise interference. To address this challenge, this work proposes a Structure- and Progress-Aware Diffusion framework (SPAD), which models the segmentation process in two stages: Semantic-concentrated Diffusion (ScD) and Boundary-concentrated Diffusion (BcD). The early stage focuses on preserving stable semantic structures, while the later stage refines boundary details. SPAD introduces semantic anchor-preserving perturbations and a progress-aware boundary noise mechanism, complemented by a progress-aware scheduler that enables gradual learning from structural understanding to boundary refinement. Experimental results demonstrate that SPAD significantly improves segmentation accuracy and exhibits enhanced robustness and generalization, particularly in challenging scenarios involving tumors and lesions with ill-defined boundaries.

Technology Category

Application Category

📝 Abstract
Medical image segmentation is crucial for computer-aided diagnosis, which necessitates understanding both coarse morphological and semantic structures, as well as carving fine boundaries. The morphological and semantic structures in medical images are beneficial and stable clues for target understanding. While the fine boundaries of medical targets (like tumors and lesions) are usually ambiguous and noisy since lesion overlap, annotation uncertainty, and so on, making it not reliable to serve as early supervision. However, existing methods simultaneously learn coarse structures and fine boundaries throughout the training process. In this paper, we propose a structure and progress-aware diffusion (SPAD) for medical image segmentation, which consists of a semantic-concentrated diffusion (ScD) and a boundary-centralized diffusion (BcD) modulated by a progress-aware scheduler (PaS). Specifically, the semantic-concentrated diffusion introduces anchor-preserved target perturbation, which perturbs pixels within a medical target but preserves unaltered areas as semantic anchors, encouraging the model to infer noisy target areas from the surrounding semantic context. The boundary-centralized diffusion introduces progress-aware boundary noise, which blurs unreliable and ambiguous boundaries, thus compelling the model to focus on coarse but stable anatomical morphology and global semantics. Furthermore, the progress-aware scheduler gradually modulates noise intensity of the ScD and BcD forming a coarse-to-fine diffusion paradigm, which encourage focusing on coarse morphological and semantic structures during early target understanding stages and gradually shifting to fine target boundaries during later contour adjusting stages.
Problem

Research questions and friction points this paper is trying to address.

medical image segmentation
coarse-to-fine learning
ambiguous boundaries
semantic structure
annotation uncertainty
Innovation

Methods, ideas, or system contributions that make the work stand out.

structure-aware diffusion
progress-aware scheduling
semantic-concentrated diffusion
boundary-centralized diffusion
coarse-to-fine segmentation
🔎 Similar Papers
No similar papers found.
Siyuan Song
Siyuan Song
Associate Professor, Arizona State University
Construction Safety and HealthWorkforce DevelopmentAI in ConstructionEngineering Education
G
Guyue Hu
School of Artificial Intelligence, Anhui University, 230601, Hefei, China; State Key Laboratory of Opto-Electronic Information Acquisition and Protection Technology, Anhui University, Hefei, 230601, China; Anhui Provincial Key Laboratory of Security Artificial Intelligence, Anhui University, Hefei, 230601, China; Anhui Provincial Key Laboratory of Multimodal Cognitive Computation, Anhui University, 230601, Hefei, China
Chenglong Li
Chenglong Li
Professor, The University of Florida
Drug DesignDrug DiscoveryMolecular RecognitionMolecular ModelingProtein structure and Dynamics
Dengdi Sun
Dengdi Sun
Anhui University
Machine LearningComputer Vision
Z
Zhe Jin
School of Artificial Intelligence, Anhui University, 230601, Hefei, China; Anhui Provincial Key Laboratory of Security Artificial Intelligence, Anhui University, Hefei, 230601, China; Anhui Provincial Key Laboratory of Multimodal Cognitive Computation, Anhui University, 230601, Hefei, China
Jin Tang
Jin Tang
Anhui University
Computer visionintelligent video analysis