Anatomy-Guided Representation Learning Using a Transformer-Based Network for Thyroid Nodule Segmentation in Ultrasound Images

📅 2025-12-14
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Thyroid nodule ultrasound image segmentation faces three major challenges: ambiguous boundaries, highly variable nodule sizes, and severe scarcity of annotated data—leading to weak contextual modeling and poor generalization in existing models. To address these, we propose the first semi-supervised multi-task Transformer framework specifically designed for this task. Our method innovatively incorporates anatomical priors of the thyroid gland and jointly optimizes three complementary objectives: nodule segmentation, thyroid gland segmentation, and nodule size estimation. Key technical components include a hierarchical Transformer encoder, semi-supervised pretraining with consistency regularization, local-global feature fusion, and multi-task collaborative optimization—collectively enhancing boundary discrimination and scale robustness. On the TN3K and DDTI benchmarks, our approach achieves state-of-the-art Dice scores and superior cross-dataset generalization, demonstrating strong clinical applicability.

Technology Category

Application Category

📝 Abstract
Accurate thyroid nodule segmentation in ultrasound images is critical for diagnosis and treatment planning. However, ambiguous boundaries between nodules and surrounding tissues, size variations, and the scarcity of annotated ultrasound data pose significant challenges for automated segmentation. Existing deep learning models struggle to incorporate contextual information from the thyroid gland and generalize effectively across diverse cases. To address these challenges, we propose SSMT-Net, a Semi-Supervised Multi-Task Transformer-based Network that leverages unlabeled data to enhance Transformer-centric encoder feature extraction capability in an initial unsupervised phase. In the supervised phase, the model jointly optimizes nodule segmentation, gland segmentation, and nodule size estimation, integrating both local and global contextual features. Extensive evaluations on the TN3K and DDTI datasets demonstrate that SSMT-Net outperforms state-of-the-art methods, with higher accuracy and robustness, indicating its potential for real-world clinical applications.
Problem

Research questions and friction points this paper is trying to address.

Segment thyroid nodules in ultrasound images accurately
Address ambiguous boundaries and size variations challenges
Incorporate contextual information for better generalization
Innovation

Methods, ideas, or system contributions that make the work stand out.

Transformer-based network for thyroid nodule segmentation
Semi-supervised multi-task learning with unlabeled data
Joint optimization of segmentation and size estimation tasks
🔎 Similar Papers
No similar papers found.
M
Muhammad Umar Farooq
Department of Computer Science, Hanyang University, Seoul, 04762, South Korea
A
Abd Ur Rehman
Department of Computer Science, The University of Alabama, Seoul, 04762, South Korea
A
Azka Rehman
Department of Biomedical Sciences, Seoul National University, Seoul, 08826, South Korea
M
Muhammad Usman
Department of Anesthesiology, Perioperative and Pain Medicine, Stanford University, CA 94305, USA
Dong-Kyu Chae
Dong-Kyu Chae
Assistant Professor, Hanyang University
recommender systemsdeep learningdata mining
Junaid Qadir
Junaid Qadir
Professor of Computer Engineering, Qatar University
Human-centered AIAI EthicsEngineering EducationAI in EducationHealthcare AI