Cross-Source Supervision for Bone Infection Segmentation in Dual-Modality PET-CT

📅 2026-05-10
📈 Citations: 0
Influential: 0
📄 PDF

career value

170K/year
🤖 AI Summary
This study addresses the challenge of segmenting bone infections in PET-CT images, where ambiguous lesion boundaries and inter-expert annotation inconsistencies hinder reliable delineation. To overcome this, the authors propose an end-to-end dual-modality segmentation framework that fuses PET metabolic and CT bone-window anatomical information at an early stage. A decoupled dual-source supervision strategy is introduced, enabling parallel training on independent expert annotations aligned with distinct clinical intents—high sensitivity versus high specificity—thereby avoiding forced consensus. Patient-level 3D voxel evaluation combined with rigorous cross-validation mitigates slice-wise correlation bias under limited sample sizes. Experimental results demonstrate that the model effectively internalizes divergent diagnostic philosophies, significantly enhancing segmentation robustness and clinical applicability at the patient level (reported as mean ± standard deviation).
📝 Abstract
Early and accurate diagnosis and lesion localization of bone infections are crucial for clinical treatment. PET-CT integrates anatomical information from CT with metabolic information from PET, making it an important imaging modality for diagnosing bone infections. However, accurate lesion segmentation remains challenging due to indistinct lesion boundaries and inconsistencies in annotations generated by different experts or automated systems. In this work, we investigate multimodal segmentation of bone infections under annotation discrepancy. We develop a bimodal end-to-end segmentation framework that integrates PET metabolic signals and CT bone-window anatomy through an early-fusion multimodal representation.To mitigate performance inflation caused by inter-slice correlation in small datasets, this study discards traditional two-dimensional evaluation methods and implements a rigorous patient-level 3D volumetric evaluation and cross-validation. Furthermore, instead of forcing a singular consensus, we propose a decoupled dual-source learning framework where parallel models are trained on independent expert annotations driven by high-sensitivity and high-specificity clinical intents. Experimental results objectively report performance variations at the patient level (Mean + SD and Mean - SD), demonstrating the effectiveness of multimodal PET-CT fusion. The cross-evaluation matrix quantitatively reveals how models successfully internalize distinct expert diagnostic philosophies, providing a robust, diversity-preserving paradigm for clinical AI deployment in bone infection segmentation.
Problem

Research questions and friction points this paper is trying to address.

bone infection segmentation
PET-CT
annotation discrepancy
multimodal imaging
lesion localization
Innovation

Methods, ideas, or system contributions that make the work stand out.

multimodal fusion
cross-source supervision
3D volumetric evaluation
decoupled dual-source learning
bone infection segmentation
🔎 Similar Papers
No similar papers found.
Zonglin Yang
Zonglin Yang
Ph.D. in Computer Science, Nanyang Technological University
Natural Language ProcessingLLMs for Scientific DiscoveryLarge Reasoning Models
X
Xiaolei Diao
University College London
J
Jishizhan Chen
University College London
X
Xiaozhuang Man
Shanghai Sixth People’s Hospital
Wei Kong
Wei Kong
Westlake University
G
Gen Wen
Shanghai Sixth People’s Hospital
P
Pengfei Cheng
Shanghai Sixth People’s Hospital, Affiliated to SJTU School of Medicine
Daqian Shi
Daqian Shi
University of Trento, UCL, QMUL