NeRF-based CBCT Reconstruction needs Normalization and Initialization

📅 2025-06-24
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In neural radiance field (NeRF)-based cone-beam CT (CBCT) reconstruction, training instability, slow convergence, and severe artifacts arise from misalignment between local features encoded by hash tables and global features learned by the neural network. To address this, we propose a normalized hash encoder and a mapping-consistent initialization strategy: the former enforces feature normalization to improve local encoding consistency, while the latter leverages geometric priors from a pre-trained NeRF model to semantically align and initialize network weights. Integrated within the NeRF framework, our approach synergizes hash-based acceleration with transfer learning, requiring only minimal code modifications. Evaluated on 128 CT volumes across four datasets—spanning seven anatomical regions—our method significantly enhances training stability and convergence speed. Quantitatively, reconstruction quality improves by an average of 2.1 dB in PSNR and 0.032 in SSIM, while artifact suppression is markedly strengthened.

Technology Category

Application Category

📝 Abstract
Cone Beam Computed Tomography (CBCT) is widely used in medical imaging. However, the limited number and intensity of X-ray projections make reconstruction an ill-posed problem with severe artifacts. NeRF-based methods have achieved great success in this task. However, they suffer from a local-global training mismatch between their two key components: the hash encoder and the neural network. Specifically, in each training step, only a subset of the hash encoder's parameters is used (local sparse), whereas all parameters in the neural network participate (global dense). Consequently, hash features generated in each step are highly misaligned, as they come from different subsets of the hash encoder. These misalignments from different training steps are then fed into the neural network, causing repeated inconsistent global updates in training, which leads to unstable training, slower convergence, and degraded reconstruction quality. Aiming to alleviate the impact of this local-global optimization mismatch, we introduce a Normalized Hash Encoder, which enhances feature consistency and mitigates the mismatch. Additionally, we propose a Mapping Consistency Initialization(MCI) strategy that initializes the neural network before training by leveraging the global mapping property from a well-trained model. The initialized neural network exhibits improved stability during early training, enabling faster convergence and enhanced reconstruction performance. Our method is simple yet effective, requiring only a few lines of code while substantially improving training efficiency on 128 CT cases collected from 4 different datasets, covering 7 distinct anatomical regions.
Problem

Research questions and friction points this paper is trying to address.

Addresses local-global training mismatch in NeRF-based CBCT reconstruction
Improves feature consistency with Normalized Hash Encoder
Enhances stability via Mapping Consistency Initialization strategy
Innovation

Methods, ideas, or system contributions that make the work stand out.

Normalized Hash Encoder enhances feature consistency
Mapping Consistency Initialization improves training stability
Simple yet effective method boosts reconstruction performance
🔎 Similar Papers
No similar papers found.
Z
Zhuowei Xu
School of Biomedical Engineering, Division of Life Sciences and Medicine, University of Science and Technology of China (USTC), Hefei, 230026, China; Center for Medical Imaging, Robotics, Analytic Computing & Learning (MIRACLE), Suzhou Institute for Advance Research, USTC, Suzhou, 215123, China
H
Han Li
Center for Medical Imaging, Robotics, Analytic Computing & Learning (MIRACLE), Suzhou Institute for Advance Research, USTC, Suzhou, 215123, China; Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, 100190, China; Munich Center for Machine Learning (MCML), Munich, Germany
D
Dai Sun
School of Biomedical Engineering, Division of Life Sciences and Medicine, University of Science and Technology of China (USTC), Hefei, 230026, China; Center for Medical Imaging, Robotics, Analytic Computing & Learning (MIRACLE), Suzhou Institute for Advance Research, USTC, Suzhou, 215123, China
Z
Zhicheng Li
School of Biomedical Engineering, Division of Life Sciences and Medicine, University of Science and Technology of China (USTC), Hefei, 230026, China; Center for Medical Imaging, Robotics, Analytic Computing & Learning (MIRACLE), Suzhou Institute for Advance Research, USTC, Suzhou, 215123, China
Yujia Li
Yujia Li
Research Scientist, Google DeepMind
Machine LearningComputer VisionNatural Language ProcessingOptimization
Qingpeng Kong
Qingpeng Kong
University of California, Berkeley
Medical Image AnalysisAI Fairness
Z
Zhiwei Cheng
School of Biomedical Engineering, Division of Life Sciences and Medicine, University of Science and Technology of China (USTC), Hefei, 230026, China; Center for Medical Imaging, Robotics, Analytic Computing & Learning (MIRACLE), Suzhou Institute for Advance Research, USTC, Suzhou, 215123, China
Nassir Navab
Nassir Navab
Professor of Computer Science, Technische Universität München
S
S. Kevin Zhou
School of Biomedical Engineering, Division of Life Sciences and Medicine, University of Science and Technology of China (USTC), Hefei, 230026, China; Center for Medical Imaging, Robotics, Analytic Computing & Learning (MIRACLE), Suzhou Institute for Advance Research, USTC, Suzhou, 215123, China; Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, 100190, China; Jiangsu Provincial Key Laboratory of Multimodal Digital Twin