Sat-DN: Implicit Surface Reconstruction from Multi-View Satellite Images with Depth and Normal Supervision

📅 2025-02-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address geometric distortions in implicit surface reconstruction from multi-view satellite imagery—caused by low texture, illumination variations, and façade occlusions—this paper proposes a progressive multi-resolution hash grid architecture. Methodologically, it integrates explicit depth map supervision with surface normal consistency regularization, leveraging hash-encoded neural implicit representations and a progressive frequency-uplifting training strategy to enable coarse-to-fine geometric modeling. Compared to conventional stereo matching approaches, the method achieves significantly higher geometric fidelity, especially in fine-scale details; relative to NeRF-based methods, it accelerates training substantially while mitigating overfitting. Evaluated on the DFC2019 dataset, the approach attains state-of-the-art performance, demonstrating superior building contour sharpness and planar structural accuracy over existing methods.

Technology Category

Application Category

📝 Abstract
With advancements in satellite imaging technology, acquiring high-resolution multi-view satellite imagery has become increasingly accessible, enabling rapid and location-independent ground model reconstruction. However, traditional stereo matching methods struggle to capture fine details, and while neural radiance fields (NeRFs) achieve high-quality reconstructions, their training time is prohibitively long. Moreover, challenges such as low visibility of building facades, illumination and style differences between pixels, and weakly textured regions in satellite imagery further make it hard to reconstruct reasonable terrain geometry and detailed building facades. To address these issues, we propose Sat-DN, a novel framework leveraging a progressively trained multi-resolution hash grid reconstruction architecture with explicit depth guidance and surface normal consistency constraints to enhance reconstruction quality. The multi-resolution hash grid accelerates training, while the progressive strategy incrementally increases the learning frequency, using coarse low-frequency geometry to guide the reconstruction of fine high-frequency details. The depth and normal constraints ensure a clear building outline and correct planar distribution. Extensive experiments on the DFC2019 dataset demonstrate that Sat-DN outperforms existing methods, achieving state-of-the-art results in both qualitative and quantitative evaluations. The code is available at https://github.com/costune/SatDN.
Problem

Research questions and friction points this paper is trying to address.

Reconstruct terrain geometry from satellite images
Enhance building facade details in reconstructions
Accelerate training of high-quality satellite image models
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-resolution hash grid
Depth and normal constraints
Progressive training strategy
Tianle Liu
Tianle Liu
Ph.D. in statistics at Harvard University
applied probabilitystatistical inferencemachine learning
S
Shuangming Zhao
School of Remote Sensing Information Engineering, Wuhan University, Wuhan, China
W
Wanshou Jiang
State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, China
B
Bingxuan Guo
State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, China