Completion as Enhancement: A Degradation-Aware Selective Image Guided Network for Depth Completion

📅 2024-12-26
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the limitations in RGB-D fusion caused by sparse, irregular, and ambiguous depth maps, this paper proposes a degradation-aware depth enhancement paradigm that reformulates depth completion as a selective high-frequency compensation task. Methodologically, we first generate an initial coarse depth map via non-CNN-based sparse-to-dense interpolation. Second, we introduce a self-supervised degradation modeling module that implicitly learns RGB-guided, edge-adaptive degradation patterns. Third, we design a multimodal conditional Mamba architecture that dynamically generates state parameters to capture global high-frequency interactions. To the best of our knowledge, this is the first work to integrate explicit degradation modeling with the Mamba architecture for depth enhancement. Our approach achieves state-of-the-art performance on four benchmark datasets—NYUv2, DIML, SUN RGB-D, and TOFDC—demonstrating significant improvements in depth completion accuracy and structural fidelity.

Technology Category

Application Category

📝 Abstract
In this paper, we introduce the Selective Image Guided Network (SigNet), a novel degradation-aware framework that transforms depth completion into depth enhancement for the first time. Moving beyond direct completion using convolutional neural networks (CNNs), SigNet initially densifies sparse depth data through non-CNN densification tools to obtain coarse yet dense depth. This approach eliminates the mismatch and ambiguity caused by direct convolution over irregularly sampled sparse data. Subsequently, SigNet redefines completion as enhancement, establishing a self-supervised degradation bridge between the coarse depth and the targeted dense depth for effective RGB-D fusion. To achieve this, SigNet leverages the implicit degradation to adaptively select high-frequency components (e.g., edges) of RGB data to compensate for the coarse depth. This degradation is further integrated into a multi-modal conditional Mamba, dynamically generating the state parameters to enable efficient global high-frequency information interaction. We conduct extensive experiments on the NYUv2, DIML, SUN RGBD, and TOFDC datasets, demonstrating the state-of-the-art (SOTA) performance of SigNet.
Problem

Research questions and friction points this paper is trying to address.

Image Depth Enhancement
Color Information Integration
Irregular and Ambiguous Depth Data Handling
Innovation

Methods, ideas, or system contributions that make the work stand out.

SigNet
Depth Information Enhancement
Multi-modal Conditional Mamba Algorithm
🔎 Similar Papers
No similar papers found.
Zhiqiang Yan
Zhiqiang Yan
National University of Singapore
3D computer visiondepth perceptionoccupancy prediction
Zhengxue Wang
Zhengxue Wang
Nanjing University of Science and Technology
Depth/RGB image restoration
K
Kun Wang
Nanjing University of Science and Technology, China
J
Jun Li
Nanjing University of Science and Technology, China
J
Jian Yang
Nanjing University of Science and Technology, China