Robust 3D Shape Reconstruction in Zero-Shot from a Single Image in the Wild

📅 2024-03-21

📈 Citations: 2

✨ Influential: 0

career value

174K/year

🤖 AI Summary

Zero-shot monocular 3D shape reconstruction suffers significant performance degradation in real-world scenarios due to inaccurate object segmentation and severe occlusion. To address this, we propose the first end-to-end occlusion-aware joint segmentation-reconstruction framework. Methodologically: (1) we unify segmentation and reconstruction into a single task and design a lightweight network for joint optimization; (2) we develop a scalable synthetic data pipeline that explicitly models diverse object-occluder-background configurations; (3) our model is trained exclusively on synthetic data—requiring no real 3D ground truth annotations or fine-tuning. Experiments demonstrate state-of-the-art zero-shot performance on real images, with substantially fewer parameters than existing methods. Moreover, our approach exhibits strong robustness to segmentation errors and heavy occlusion, enabling reliable reconstruction under challenging real-world conditions.

Technology Category

Application Category

📝 Abstract

Recent monocular 3D shape reconstruction methods have shown promising zero-shot results on object-segmented images without any occlusions. However, their effectiveness is significantly compromised in real-world conditions, due to imperfect object segmentation by off-the-shelf models and the prevalence of occlusions. To effectively address these issues, we propose a unified regression model that integrates segmentation and reconstruction, specifically designed for occlusion-aware 3D shape reconstruction. To facilitate its reconstruction in the wild, we also introduce a scalable data synthesis pipeline that simulates a wide range of variations in objects, occluders, and backgrounds. Training on our synthetic data enables the proposed model to achieve state-of-the-art zero-shot results on real-world images, using significantly fewer parameters than competing approaches.

Problem

Research questions and friction points this paper is trying to address.

Address imperfect object segmentation in 3D reconstruction

Handle occlusion-aware 3D shape reconstruction in real-world

Enable robust zero-shot 3D reconstruction from single images

Innovation

Methods, ideas, or system contributions that make the work stand out.

Unified model integrates segmentation and reconstruction

Scalable data synthesis simulates diverse variations

Occlusion-aware 3D shape reconstruction in wild

🔎 Similar Papers

SDFit: 3D Object Pose and Shape by Fitting a Morphable SDF to a Single Image