DaRF: Boosting Radiance Fields from Sparse Inputs with Monocular Depth Adaptation

Song, Jiuhn; Park, Seonghoon; An, Honggyu; Cho, Seokju; Kwak, Min-Seop; Cho, Sungjin; Kim, Seungryong

Computer Science > Computer Vision and Pattern Recognition

arXiv:2305.19201 (cs)

[Submitted on 30 May 2023 (v1), last revised 25 Sep 2023 (this version, v2)]

Title:DaRF: Boosting Radiance Fields from Sparse Inputs with Monocular Depth Adaptation

Authors:Jiuhn Song, Seonghoon Park, Honggyu An, Seokju Cho, Min-Seop Kwak, Sungjin Cho, Seungryong Kim

View PDF

Abstract:Neural radiance fields (NeRF) shows powerful performance in novel view synthesis and 3D geometry reconstruction, but it suffers from critical performance degradation when the number of known viewpoints is drastically reduced. Existing works attempt to overcome this problem by employing external priors, but their success is limited to certain types of scenes or datasets. Employing monocular depth estimation (MDE) networks, pretrained on large-scale RGB-D datasets, with powerful generalization capability would be a key to solving this problem: however, using MDE in conjunction with NeRF comes with a new set of challenges due to various ambiguity problems exhibited by monocular depths. In this light, we propose a novel framework, dubbed DäRF, that achieves robust NeRF reconstruction with a handful of real-world images by combining the strengths of NeRF and monocular depth estimation through online complementary training. Our framework imposes the MDE network's powerful geometry prior to NeRF representation at both seen and unseen viewpoints to enhance its robustness and coherence. In addition, we overcome the ambiguity problems of monocular depths through patch-wise scale-shift fitting and geometry distillation, which adapts the MDE network to produce depths aligned accurately with NeRF geometry. Experiments show our framework achieves state-of-the-art results both quantitatively and qualitatively, demonstrating consistent and reliable performance in both indoor and outdoor real-world datasets. Project page is available at this https URL.

Comments:	To appear at NeurIPS 2023. Project Page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2305.19201 [cs.CV]
	(or arXiv:2305.19201v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2305.19201

Submission history

From: Jiuhn Song [view email]
[v1] Tue, 30 May 2023 16:46:41 UTC (40,259 KB)
[v2] Mon, 25 Sep 2023 15:56:58 UTC (47,839 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:DaRF: Boosting Radiance Fields from Sparse Inputs with Monocular Depth Adaptation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:DaRF: Boosting Radiance Fields from Sparse Inputs with Monocular Depth Adaptation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators