FCaS: Fine-grained Cardiac Image Synthesis based on 3D Template Conditional Diffusion Model

📅 2025-03-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Addressing challenges in cardiac medical imaging—including difficulty reconstructing fine-grained anatomical structures (e.g., coronary vessels), stringent topological consistency requirements, high 3D morphological heterogeneity, and severe scarcity of annotated data—this paper proposes a Template-guided Bidirectional Conditional Diffusion Model (TCDM). TCDM introduces, for the first time, a 3D template-driven generative mechanism; integrates a deformable mask generation module (MGM) to alleviate reliance on high-quality reference masks; and incorporates a confidence-aware adaptive learning (CAL) strategy that leverages skip-sampling variance (SSV) estimation to optimize downstream segmentation pretraining. Experiments demonstrate that TCDM achieves state-of-the-art performance in both topological fidelity and visual quality, significantly improving segmentation accuracy for cardiac substructures. The method establishes a novel paradigm for few-shot cardiac image synthesis.

Technology Category

Application Category

📝 Abstract
Solving medical imaging data scarcity through semantic image generation has attracted significant attention in recent years. However, existing methods primarily focus on generating whole-organ or large-tissue structures, showing limited effectiveness for organs with fine-grained structure. Due to stringent topological consistency, fragile coronary features, and complex 3D morphological heterogeneity in cardiac imaging, accurately reconstructing fine-grained anatomical details of the heart remains a great challenge. To address this problem, in this paper, we propose the Fine-grained Cardiac image Synthesis(FCaS) framework, established on 3D template conditional diffusion model. FCaS achieves precise cardiac structure generation using Template-guided Conditional Diffusion Model (TCDM) through bidirectional mechanisms, which provides the fine-grained topological structure information of target image through the guidance of template. Meanwhile, we design a deformable Mask Generation Module (MGM) to mitigate the scarcity of high-quality and diverse reference mask in the generation process. Furthermore, to alleviate the confusion caused by imprecise synthetic images, we propose a Confidence-aware Adaptive Learning (CAL) strategy to facilitate the pre-training of downstream segmentation tasks. Specifically, we introduce the Skip-Sampling Variance (SSV) estimation to obtain confidence maps, which are subsequently employed to rectify the pre-training on downstream tasks. Experimental results demonstrate that images generated from FCaS achieves state-of-the-art performance in topological consistency and visual quality, which significantly facilitates the downstream tasks as well. Code will be released in the future.
Problem

Research questions and friction points this paper is trying to address.

Generates fine-grained cardiac images using 3D template diffusion.
Addresses scarcity of high-quality cardiac imaging data.
Improves downstream medical segmentation tasks with adaptive learning.
Innovation

Methods, ideas, or system contributions that make the work stand out.

3D template conditional diffusion model
Deformable Mask Generation Module
Confidence-aware Adaptive Learning strategy
🔎 Similar Papers
No similar papers found.
Jiahao Xia
Jiahao Xia
Research Fellow, University of Technology Sydney
Deep Learning
Y
Yutao Hu
Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications (Southeast University), Ministry of Education, China
Yaolei Qi
Yaolei Qi
Southeast University & University of Cambridge
Medical Image AnalysisComputer VisionDeep Learning
Z
Zhenliang Li
Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications (Southeast University), Ministry of Education, China
Wenqi Shao
Wenqi Shao
Researcher at Shanghai AI Laboratory
Foundation Model EvaluationLLM CompressionEfficient AdaptationMultimodal Learning
Junjun He
Junjun He
Shanghai Jiao Tong University
Y
Ying Fu
School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China
L
Longjiang Zhang
Department of Radiology, Jinling Hospital, Affiliated Hospital of Medical School, Nanjing University, Nanjing, China
G
Guanyu Yang
Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications (Southeast University), Ministry of Education, China; Jiangsu Province Joint International Research Laboratory of Medical Information Processing, Southeast University, Nanjing, China; Univ Rennes, CHU Rennes, Inserm, LTSI– UMR 1099, F-35000 Rennes, France