ASSR-NeRF: Arbitrary-Scale Super-Resolution on Voxel Grid for High-Quality Radiance Fields Reconstruction

๐Ÿ“… 2024-06-28
๐Ÿ›๏ธ arXiv.org
๐Ÿ“ˆ Citations: 2
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
To address detail loss and multi-view inconsistency in high-resolution novel view synthesis from low-resolution inputs, this paper proposes VoxelGridSRโ€”the first end-to-end, attention-driven 3D voxel super-resolution framework. Operating directly on voxel grids optimized via NeRF, it models arbitrary-scale 3D super-resolution through joint explicit voxel representation and implicit NeRF optimization, ensuring strict multi-view consistency in both geometry and appearance. Crucially, it generalizes zero-shot to unseen scenes and arbitrary scale factors without scene-specific fine-tuning. On multiple benchmarks, VoxelGridSR achieves significant PSNR and SSIM improvements over prior methods. Visually, it recovers sharp geometric structures and high-frequency textures, effectively mitigating the over-smoothing artifacts inherent in conventional NeRFs and the view-inconsistency issues of single-image super-resolution approaches.

Technology Category

Application Category

๐Ÿ“ Abstract
NeRF-based methods reconstruct 3D scenes by building a radiance field with implicit or explicit representations. While NeRF-based methods can perform novel view synthesis (NVS) at arbitrary scale, the performance in high-resolution novel view synthesis (HRNVS) with low-resolution (LR) optimization often results in oversmoothing. On the other hand, single-image super-resolution (SR) aims to enhance LR images to HR counterparts but lacks multi-view consistency. To address these challenges, we propose Arbitrary-Scale Super-Resolution NeRF (ASSR-NeRF), a novel framework for super-resolution novel view synthesis (SRNVS). We propose an attention-based VoxelGridSR model to directly perform 3D super-resolution (SR) on the optimized volume. Our model is trained on diverse scenes to ensure generalizability. For unseen scenes trained with LR views, we then can directly apply our VoxelGridSR to further refine the volume and achieve multi-view consistent SR. We demonstrate quantitative and qualitatively that the proposed method achieves significant performance in SRNVS.
Problem

Research questions and friction points this paper is trying to address.

Achieves multi-view consistent super-resolution for NeRF scenes
Enhances low-resolution optimized volumes to high-resolution without oversmoothing
Enables arbitrary-scale super-resolution on voxel grids for novel view synthesis
Innovation

Methods, ideas, or system contributions that make the work stand out.

Attention-based VoxelGridSR model for 3D super-resolution
Direct 3D super-resolution on optimized volume
Multi-view consistent super-resolution for unseen scenes
๐Ÿ”Ž Similar Papers
No similar papers found.