PRO: Projection Domain Synthesis for CT Imaging

πŸ“… 2025-06-16
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
To address the dual challenges of scarce annotated data and physical modeling inaccuracies in CT image synthesis, this paper proposes the Projection-domain Diffusion Model (PDM)β€”the first diffusion-based framework that tightly integrates CT imaging physics priors with anatomical semantic text prompts for end-to-end, controllable cross-modal generation. Unlike conventional image-domain approaches, PDM operates directly in the raw projection domain to avoid error accumulation from intermediate reconstructions. It introduces a novel text-projection joint conditioning mechanism to enable structure-preserving and anatomy-consistent synthesis with explicit control. Experiments demonstrate that, under low-dose and sparse-view reconstruction settings, PDM achieves an average 2.1 dB PSNR improvement in downstream tasks using only few-shot training data. Moreover, the model exhibits strong multi-task generalization capability, establishing itself as a foundational generative model for CT. The implementation is publicly released to facilitate practical medical image data augmentation.

Technology Category

Application Category

πŸ“ Abstract
Synthesizing high quality CT images remains a signifi-cant challenge due to the limited availability of annotat-ed data and the complex nature of CT imaging. In this work, we present PRO, a novel framework that, to the best of our knowledge, is the first to perform CT image synthesis in the projection domain using latent diffusion models. Unlike previous approaches that operate in the image domain, PRO learns rich structural representa-tions from raw projection data and leverages anatomi-cal text prompts for controllable synthesis. This projec-tion domain strategy enables more faithful modeling of underlying imaging physics and anatomical structures. Moreover, PRO functions as a foundation model, capa-ble of generalizing across diverse downstream tasks by adjusting its generative behavior via prompt inputs. Experimental results demonstrated that incorporating our synthesized data significantly improves perfor-mance across multiple downstream tasks, including low-dose and sparse-view reconstruction, even with limited training data. These findings underscore the versatility and scalability of PRO in data generation for various CT applications. These results highlight the potential of projection domain synthesis as a powerful tool for data augmentation and robust CT imaging. Our source code is publicly available at: https://github.com/yqx7150/PRO.
Problem

Research questions and friction points this paper is trying to address.

Synthesizing high-quality CT images with limited annotated data
Performing CT image synthesis in the projection domain
Improving performance in low-dose and sparse-view reconstruction tasks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses latent diffusion models for CT synthesis
Operates in projection domain for better accuracy
Leverages anatomical text prompts for control
πŸ”Ž Similar Papers
No similar papers found.
K
Kang Chen
School of Mathematics and Computer Sciences, Nanchang University, Nanchang 330031, China
B
Bin Huang
School of Mathematics and Computer Sciences, Nanchang University, Nanchang 330031, China
X
Xuebin Yang
School of Mathematics and Computer Sciences, Nanchang University, Nanchang 330031, China
Junyan Zhang
Junyan Zhang
National University of Singapore
Large Language Model
Qiegen Liu
Qiegen Liu
Nanchang university
medical imagingimage processing