Innovative Tooth Segmentation Using Hierarchical Features and Bidirectional Sequence Modeling

📅 2026-02-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the limitations of existing dental image segmentation methods, which suffer from discontinuous boundaries and weak foreground-background discrimination due to fixed-resolution feature maps, as well as the high computational cost of Transformer-based self-attention mechanisms that hinder efficient processing of high-resolution images. To overcome these challenges, the authors propose a three-stage hierarchical encoder architecture that integrates multi-scale features to preserve fine structural details. The design incorporates bidirectional sequential modeling and a lightweight global context mechanism, significantly reducing computational complexity while enhancing global spatial awareness. Evaluated on the OralVision dataset, the proposed method achieves a 1.1% improvement in mIoU over current state-of-the-art approaches, demonstrating its effectiveness in improving both segmentation continuity and accuracy.

Technology Category

Application Category

📝 Abstract
Tooth image segmentation is a cornerstone of dental digitization. However, traditional image encoders relying on fixed-resolution feature maps often lead to discontinuous segmentation and poor discrimination between target regions and background, due to insufficient modeling of environmental and global context. Moreover, transformer-based self-attention introduces substantial computational overhead because of its quadratic complexity (O(n^2)), making it inefficient for high-resolution dental images. To address these challenges, we introduce a three-stage encoder with hierarchical feature representation to capture scale-adaptive information in dental images. By jointly leveraging low-level details and high-level semantics through cross-scale feature fusion, the model effectively preserves fine structural information while maintaining strong contextual awareness. Furthermore, a bidirectional sequence modeling strategy is incorporated to enhance global spatial context understanding without incurring high computational cost. We validate our method on two dental datasets, with experimental results demonstrating its superiority over existing approaches. On the OralVision dataset, our model achieves a 1.1% improvement in mean intersection over union (mIoU).
Problem

Research questions and friction points this paper is trying to address.

tooth segmentation
image segmentation
context modeling
computational complexity
dental imaging
Innovation

Methods, ideas, or system contributions that make the work stand out.

hierarchical feature representation
bidirectional sequence modeling
cross-scale feature fusion
tooth segmentation
computational efficiency
🔎 Similar Papers
No similar papers found.
Xinxin Zhao
Xinxin Zhao
Renmin University of China
J
Jian Jiang
School of Computer and Cyber Sciences, Communication University of China, 100024, Beijing, China; Center of Big Data, China Digital Culture Group Co., Ltd, 100176, Beijing, China
Y
Yan Tian
School of Computer Science and Technology, Zhejiang Gongshang University, 310018, Hangzhou, China; Shining3D Tech Co., Ltd., 311258, Hangzhou, China; Zhejiang Key Laboratory of Big Data and Future E-Commerce Technology, 310018, Hangzhou, China
L
Liqin Wu
Department of Stomatology, Tongxiang Hospital of Traditional Chinese Medicine, 100024, Tongxiang, China
Z
Zhaocheng Xu
School of Mathematical and Computational Sciences, Massey University, 100024, Auckland, New Zealand
T
Teddy Yang
Oral and Maxillofacial Surgery at the Faculty of Dentistry, University of Hong Kong, 999077, Hong Kong, China
Y
Yunuo Zou
School of Art Design, Zhejiang Gongshang University, 310018, Hangzhou, China
X
Xun Wang
School of Computer Science and Technology, Zhejiang Gongshang University, 310018, Hangzhou, China