Mask-adaptive Gated Convolution and Bi-directional Progressive Fusion Network for Depth Completion

📅 2024-01-15
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address depth map completion in sparse depth images with missing pixels, this paper proposes an RGB-guided depth completion network. Methodologically, we design a mask-adaptive gated convolution (MagaConv) to enable missing-region-aware dynamic feature modulation, and introduce a bidirectional progressive fusion (BP-Fusion) module to jointly integrate depth and RGB features across a multi-scale feature pyramid. The network is trained end-to-end using a CNN architecture augmented with a multi-stage progressive upsampling strategy. Evaluated on NYU-Depth V2, DIML, and SUN RGB-D benchmarks, our approach achieves state-of-the-art performance, significantly outperforming existing methods in RMSE, REL, and other key metrics. It enhances completion accuracy, robustness to occlusions and sensor noise, and structural fidelity of recovered depth maps.

Technology Category

Application Category

📝 Abstract
Depth completion is a critical task for handling depth images with missing pixels, which can negatively impact further applications. Recent approaches have utilized Convolutional Neural Networks (CNNs) to reconstruct depth images with the assistance of color images. However, vanilla convolution has non-negligible drawbacks in handling missing pixels. To solve this problem, we propose a new model for depth completion based on an encoder-decoder structure. Our model introduces two key components: the Mask-adaptive Gated Convolution (MagaConv) architecture and the Bi-directional Progressive Fusion (BP-Fusion) module. The MagaConv architecture is designed to acquire precise depth features by modulating convolution operations with iteratively updated masks, while the BP-Fusion module progressively integrates depth and color features, utilizing consecutive bi-directional fusion structures in a global perspective. Extensive experiments on popular benchmarks, including NYU-Depth V2, DIML, and SUN RGB-D, demonstrate the superiority of our model over state-of-the-art methods. We achieved remarkable performance in completing depth maps and outperformed existing approaches in terms of accuracy and reliability.
Problem

Research questions and friction points this paper is trying to address.

Depth Image
Pixel Deficiency
Image Completion
Innovation

Methods, ideas, or system contributions that make the work stand out.

MagaConv
BP-Fusion
Depth Completion
🔎 Similar Papers
No similar papers found.
Tingxuan Huang
Tingxuan Huang
Northeastern University
CV
J
Jiacheng Miao
College of Information Science and Engineering, Northeastern University, China; Foshan Graduate School of Innovation, Northeastern University, China
S
Shizhuo Deng
College of Information Science and Engineering, Northeastern University, China; Foshan Graduate School of Innovation, Northeastern University, China
Tong Jia
Tong Jia
Peking University
AIOpsAnomaly DetectionLog AnalysisAI for Medical Research
D
Dongyue Chen
College of Information Science and Engineering, Northeastern University, China; Foshan Graduate School of Innovation, Northeastern University, China; National Frontiers Science Center for Industrial Intelligence and Systems Optimization, Northeastern University, China