PanoDiffusion: 360-degree Panorama Outpainting via Diffusion

📅 2023-07-06

🏛️ International Conference on Learning Representations

📈 Citations: 31

✨ Influential: 7

career value

223K/year

🤖 AI Summary

Generating high-fidelity, seamlessly consistent 360° indoor panoramas from narrow-field-of-view RGB images remains challenging. Method: We propose the first dual-modal latent diffusion model (LDM) for RGB-D panorama extrapolation, featuring: (1) end-to-end joint modeling of RGB and depth panoramas; (2) a progressive camera-rotation mechanism that explicitly encodes spherical geometric continuity, significantly improving azimuthal seam consistency; and (3) RGB-only inference capability without requiring input depth. Contribution/Results: Our method achieves comprehensive state-of-the-art performance on RGB-D panorama extrapolation, generalizes robustly across diverse mask configurations, and simultaneously synthesizes high-fidelity depth panoramas—establishing a novel paradigm for reconstructing photorealistic 3D indoor scenes.

📝 Abstract

Generating complete 360-degree panoramas from narrow field of view images is ongoing research as omnidirectional RGB data is not readily available. Existing GAN-based approaches face some barriers to achieving higher quality output, and have poor generalization performance over different mask types. In this paper, we present our 360-degree indoor RGB-D panorama outpainting model using latent diffusion models (LDM), called PanoDiffusion. We introduce a new bi-modal latent diffusion structure that utilizes both RGB and depth panoramic data during training, which works surprisingly well to outpaint depth-free RGB images during inference. We further propose a novel technique of introducing progressive camera rotations during each diffusion denoising step, which leads to substantial improvement in achieving panorama wraparound consistency. Results show that our PanoDiffusion not only significantly outperforms state-of-the-art methods on RGB-D panorama outpainting by producing diverse well-structured results for different types of masks, but can also synthesize high-quality depth panoramas to provide realistic 3D indoor models.

Problem

Research questions and friction points this paper is trying to address.

Generating complete 360-degree panoramas from narrow field images

Overcoming GAN limitations in quality and mask generalization

Achieving panorama consistency through diffusion with camera rotations

Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses latent diffusion models for panorama outpainting

Introduces bi-modal RGB-D diffusion structure

Applies progressive camera rotations for consistency

🔎 Similar Papers

SpotDiffusion: A Fast Approach For Seamless Panorama Generation Over Time