Abstract
Background
Novel view synthesis has grown significantly in popularity recently thanks to the introduction of implicit 3D scene represenations such as NeRF. These techniques enable many downstream applications in fields ranging from robotics to entertainment. However, most of these techniques are limited by the number of training views required: many need up to 200 views which can be prohibitively high for real-world usage. Prior work has tackled this problem with sparse view techniques, but these almost entirely focus on forward-facing scenes and suffer from long training and inference times of NeRFs.
Objective
We introduce a technique for real-time 360 sparse view synthesis by leveraging 3D Gaussian Splatting. The explicit nature of our scene representations allows to reduce sparse view artifacts with techniques that directly operate on the scene representation in an adaptive manner. Combined with depth based constraints, we are able to render high-quality novel views and depth maps for unbounded scenes.
Key Ideas
- Leverage the explicit gaussian representation to directly remove unwanted sparse view artifacts such as “floaters” and “background collapse”
- Use off-the-shelf depth estimation models to regularize novel view outputs
- Reconstruct regions with low coverage in training views with diffusion-model guidance
Floater Removal Example
More Pictures
More Videos (Updating)
Table: Some sparse-view NeRF baseline comparisons on the MipNeRF360 dataset
Model | PSNR ↑ | SSIM ↑ | LPIPS ↓ | Runtime* (h) | Render FPS |
---|---|---|---|---|---|
SparseNeRF | 11.5638 | 0.3206 | 0.6984 | 4 | 1/120 |
Base 3DGS | 15.3840 | 0.4415 | 0.5061 | 0.5 | 30 |
Mip-NeRF 360 | 17.1044 | 0.4660 | 0.5750 | 3 | 1/120 |
RegNeRF | 11.7379 | 0.2266 | 0.6892 | 4 | 1/120 |
ViP-NeRF | 11.1622 | 0.2291 | 0.7132 | 4 | 1/120 |
Ours | 16.6898 | 0.4899 | 0.4849 | 0.75 | 30 |
We use 12 images for each scene. * Runtimes are recorded on one RTX3090.
Citation
@article{xiong2023sparsegs,
author = {Xiong, Haolin and Muttukuru, Sairisheek and Upadhyay, Rishi and Chari, Pradyumna and Kadambi, Achuta},
title = {SparseGS: Real-Time 360° Sparse View Synthesis using Gaussian Splatting},
journal = {Arxiv},
year = {2023},
}