ASSR-NeRF: Arbitrary-Scale Super-Resolution on Voxel Grid for High-Quality Radiance Fields Reconstruction

📅 2024-06-28

🏛️ arXiv.org

📈 Citations: 2

✨ Influential: 0

career value

206K/year

🤖 AI Summary

To address detail loss and multi-view inconsistency in high-resolution novel view synthesis from low-resolution inputs, this paper proposes VoxelGridSR—the first end-to-end, attention-driven 3D voxel super-resolution framework. Operating directly on voxel grids optimized via NeRF, it models arbitrary-scale 3D super-resolution through joint explicit voxel representation and implicit NeRF optimization, ensuring strict multi-view consistency in both geometry and appearance. Crucially, it generalizes zero-shot to unseen scenes and arbitrary scale factors without scene-specific fine-tuning. On multiple benchmarks, VoxelGridSR achieves significant PSNR and SSIM improvements over prior methods. Visually, it recovers sharp geometric structures and high-frequency textures, effectively mitigating the over-smoothing artifacts inherent in conventional NeRFs and the view-inconsistency issues of single-image super-resolution approaches.

Technology Category

Application Category

📝 Abstract

NeRF-based methods reconstruct 3D scenes by building a radiance field with implicit or explicit representations. While NeRF-based methods can perform novel view synthesis (NVS) at arbitrary scale, the performance in high-resolution novel view synthesis (HRNVS) with low-resolution (LR) optimization often results in oversmoothing. On the other hand, single-image super-resolution (SR) aims to enhance LR images to HR counterparts but lacks multi-view consistency. To address these challenges, we propose Arbitrary-Scale Super-Resolution NeRF (ASSR-NeRF), a novel framework for super-resolution novel view synthesis (SRNVS). We propose an attention-based VoxelGridSR model to directly perform 3D super-resolution (SR) on the optimized volume. Our model is trained on diverse scenes to ensure generalizability. For unseen scenes trained with LR views, we then can directly apply our VoxelGridSR to further refine the volume and achieve multi-view consistent SR. We demonstrate quantitative and qualitatively that the proposed method achieves significant performance in SRNVS.

Problem

Research questions and friction points this paper is trying to address.

Achieves multi-view consistent super-resolution for NeRF scenes

Enhances low-resolution optimized volumes to high-resolution without oversmoothing

Enables arbitrary-scale super-resolution on voxel grids for novel view synthesis

Innovation

Methods, ideas, or system contributions that make the work stand out.

Attention-based VoxelGridSR model for 3D super-resolution

Direct 3D super-resolution on optimized volume

Multi-view consistent super-resolution for unseen scenes

🔎 Similar Papers

No similar papers found.

World Labs

$250,000-$350,000 base salary (good-faith estimate for San Francisco Bay Area upon hire; actual offer based on experience, skills, and qualifications)

San Francisco / San Francisco Office, San Francisco, California, United States

Authors to Follow