H3DE-Net: Efficient and Accurate 3D Landmark Detection in Medical Imaging

📅 2025-02-20
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Medical 3D images suffer from sparse anatomical landmarks and high dimensionality, making it challenging to simultaneously capture fine-grained local details and model global spatial relationships—leading to a trade-off between accuracy and efficiency. To address this, we propose a novel hybrid network architecture that, for the first time, integrates a lightweight hierarchical routing attention mechanism into a 3D CNN backbone. This enables efficient global contextual modeling and adaptive multi-scale feature fusion. The design significantly reduces computational overhead while improving robustness to missing landmarks and complex anatomical variations. Evaluated on public CT datasets, our method achieves state-of-the-art landmark detection accuracy with substantially fewer parameters and lower inference cost. It reduces mean localization error by 12.7% over existing approaches, with particularly pronounced improvements in low signal-to-noise ratio regions and areas exhibiting structural deformities.

Technology Category

Application Category

📝 Abstract
3D landmark detection is a critical task in medical image analysis, and accurately detecting anatomical landmarks is essential for subsequent medical imaging tasks. However, mainstream deep learning methods in this field struggle to simultaneously capture fine-grained local features and model global spatial relationships, while maintaining a balance between accuracy and computational efficiency. Local feature extraction requires capturing fine-grained anatomical details, while global modeling requires understanding the spatial relationships within complex anatomical structures. The high-dimensional nature of 3D volume further exacerbates these challenges, as landmarks are sparsely distributed, leading to significant computational costs. Therefore, achieving efficient and precise 3D landmark detection remains a pressing challenge in medical image analysis. In this work, We propose a extbf{H}ybrid extbf{3}D extbf{DE}tection extbf{Net}(H3DE-Net), a novel framework that combines CNNs for local feature extraction with a lightweight attention mechanism designed to efficiently capture global dependencies in 3D volumetric data. This mechanism employs a hierarchical routing strategy to reduce computational cost while maintaining global context modeling. To our knowledge, H3DE-Net is the first 3D landmark detection model that integrates such a lightweight attention mechanism with CNNs. Additionally, integrating multi-scale feature fusion further enhances detection accuracy and robustness. Experimental results on a public CT dataset demonstrate that H3DE-Net achieves state-of-the-art(SOTA) performance, significantly improving accuracy and robustness, particularly in scenarios with missing landmarks or complex anatomical variations. We aready open-source our project, including code, data and model weights.
Problem

Research questions and friction points this paper is trying to address.

Efficient 3D landmark detection in medical imaging
Balancing local feature extraction and global spatial modeling
Reducing computational cost while maintaining accuracy
Innovation

Methods, ideas, or system contributions that make the work stand out.

Combines CNNs with lightweight attention
Hierarchical routing reduces computational cost
Multi-scale feature fusion enhances accuracy
🔎 Similar Papers
No similar papers found.
Z
Zhen Huang
School of Computer Science and Technology, University of Science and Technology of China (USTC), Hefei, 230026, China; School of Information Science and Technology, Eastern Institute of Technology (EIT), Ningbo, 315200, China
R
Ronghao Xu
School of Biomedical Engineering, Division of Life Sciences and Medicine, USTC, Suzhou, 215123, China; Suzhou Institute for Advanced Research, USTC, Suzhou, 215123, China
X
Xiaoqian Zhou
School of Biomedical Engineering, Division of Life Sciences and Medicine, USTC, Suzhou, 215123, China; Suzhou Institute for Advanced Research, USTC, Suzhou, 215123, China
Y
Yangbo Wei
School of Computer Science and Technology, University of Science and Technology of China (USTC), Hefei, 230026, China; Shanghai Jiao Tong University, Shanghai, 200030, China
S
Suhua Wang
Computer Department, Changchun Humanities and Sciences College, Changchun, 130117, China
X
Xiaoxin Sun
School of Information Science and Technology, Northeast Normal University, Changchun, 130117, China
H
Han Li
Computer Aided Medical Procedures (CAMP), School of Computation, Information and Technology, Technische Universitaet Muenchen (TUM), Germany
Qingsong Yao
Qingsong Yao
Stanford University | ICT, CAS
Medical Image ComputingMedical Image Analysis