Anatomy-Guided Representation Learning Using a Transformer-Based Network for Thyroid Nodule Segmentation in Ultrasound Images

📅 2025-12-14
📈 Citations: 0
Influential: 0
📄 PDF

career value

163K/year
🤖 AI Summary
Thyroid nodule ultrasound image segmentation faces three major challenges: ambiguous boundaries, highly variable nodule sizes, and severe scarcity of annotated data—leading to weak contextual modeling and poor generalization in existing models. To address these, we propose the first semi-supervised multi-task Transformer framework specifically designed for this task. Our method innovatively incorporates anatomical priors of the thyroid gland and jointly optimizes three complementary objectives: nodule segmentation, thyroid gland segmentation, and nodule size estimation. Key technical components include a hierarchical Transformer encoder, semi-supervised pretraining with consistency regularization, local-global feature fusion, and multi-task collaborative optimization—collectively enhancing boundary discrimination and scale robustness. On the TN3K and DDTI benchmarks, our approach achieves state-of-the-art Dice scores and superior cross-dataset generalization, demonstrating strong clinical applicability.

Technology Category

Application Category

📝 Abstract
Accurate thyroid nodule segmentation in ultrasound images is critical for diagnosis and treatment planning. However, ambiguous boundaries between nodules and surrounding tissues, size variations, and the scarcity of annotated ultrasound data pose significant challenges for automated segmentation. Existing deep learning models struggle to incorporate contextual information from the thyroid gland and generalize effectively across diverse cases. To address these challenges, we propose SSMT-Net, a Semi-Supervised Multi-Task Transformer-based Network that leverages unlabeled data to enhance Transformer-centric encoder feature extraction capability in an initial unsupervised phase. In the supervised phase, the model jointly optimizes nodule segmentation, gland segmentation, and nodule size estimation, integrating both local and global contextual features. Extensive evaluations on the TN3K and DDTI datasets demonstrate that SSMT-Net outperforms state-of-the-art methods, with higher accuracy and robustness, indicating its potential for real-world clinical applications.
Problem

Research questions and friction points this paper is trying to address.

Segment thyroid nodules in ultrasound images accurately
Address ambiguous boundaries and size variations challenges
Incorporate contextual information for better generalization
Innovation

Methods, ideas, or system contributions that make the work stand out.

Transformer-based network for thyroid nodule segmentation
Semi-supervised multi-task learning with unlabeled data
Joint optimization of segmentation and size estimation tasks
🔎 Similar Papers
No similar papers found.