DDL-MVS: Depth Discontinuity Learning for Multi-View Stereo Networks

We propose an enhancement module called depth discontinuity learning (DDL) for learning-based multi-view stereo (MVS) methods. Traditional methods are known for their accuracy but struggle with completeness. While recent learning-based methods have improved completeness at the cost of accuracy, our...

Full description

Bibliographic Details
Main Authors: Nail Ibrahimli, Hugo Ledoux, Julian F. P. Kooij, Liangliang Nan
Format: Article
Language:English
Published: MDPI AG 2023-06-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/15/12/2970
_version_ 1797592845050183680
author Nail Ibrahimli
Hugo Ledoux
Julian F. P. Kooij
Liangliang Nan
author_facet Nail Ibrahimli
Hugo Ledoux
Julian F. P. Kooij
Liangliang Nan
author_sort Nail Ibrahimli
collection DOAJ
description We propose an enhancement module called depth discontinuity learning (DDL) for learning-based multi-view stereo (MVS) methods. Traditional methods are known for their accuracy but struggle with completeness. While recent learning-based methods have improved completeness at the cost of accuracy, our DDL approach aims to improve accuracy while retaining completeness in the reconstruction process. To achieve this, we introduce the joint estimation of depth and boundary maps, where the boundary maps are explicitly utilized for further refinement of the depth maps. We validate our idea by integrating it into an existing learning-based MVS pipeline where the reconstruction depends on high-quality depth map estimation. Extensive experiments on various datasets, namely DTU, ETH3D, “Tanks and Temples”, and BlendedMVS, show that our method improves reconstruction quality compared to our baseline, Patchmatchnet. Our ablation study demonstrates that incorporating the proposed DDL significantly reduces the depth map error, for instance, by more than 30% on the DTU dataset, and leads to improved depth map quality in both smooth and boundary regions. Additionally, our qualitative analysis has shown that the reconstructed point cloud exhibits enhanced quality without any significant compromise on completeness. Finally, the experiments reveal that our proposed model and strategies exhibit strong generalization capabilities across the various datasets.
first_indexed 2024-03-11T01:58:53Z
format Article
id doaj.art-6363a1e1ccef4f8289b528048b35e467
institution Directory Open Access Journal
issn 2072-4292
language English
last_indexed 2024-03-11T01:58:53Z
publishDate 2023-06-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj.art-6363a1e1ccef4f8289b528048b35e4672023-11-18T12:24:29ZengMDPI AGRemote Sensing2072-42922023-06-011512297010.3390/rs15122970DDL-MVS: Depth Discontinuity Learning for Multi-View Stereo NetworksNail Ibrahimli0Hugo Ledoux1Julian F. P. Kooij2Liangliang Nan33D Geoinformation, Faculty of Architecture and the Built Environment, Delft University of Technology, 2628 BL Delft, The Netherlands3D Geoinformation, Faculty of Architecture and the Built Environment, Delft University of Technology, 2628 BL Delft, The NetherlandsIntelligent Vehicles Group, Delft University of Technology, 2628 BL Delft, The Netherlands3D Geoinformation, Faculty of Architecture and the Built Environment, Delft University of Technology, 2628 BL Delft, The NetherlandsWe propose an enhancement module called depth discontinuity learning (DDL) for learning-based multi-view stereo (MVS) methods. Traditional methods are known for their accuracy but struggle with completeness. While recent learning-based methods have improved completeness at the cost of accuracy, our DDL approach aims to improve accuracy while retaining completeness in the reconstruction process. To achieve this, we introduce the joint estimation of depth and boundary maps, where the boundary maps are explicitly utilized for further refinement of the depth maps. We validate our idea by integrating it into an existing learning-based MVS pipeline where the reconstruction depends on high-quality depth map estimation. Extensive experiments on various datasets, namely DTU, ETH3D, “Tanks and Temples”, and BlendedMVS, show that our method improves reconstruction quality compared to our baseline, Patchmatchnet. Our ablation study demonstrates that incorporating the proposed DDL significantly reduces the depth map error, for instance, by more than 30% on the DTU dataset, and leads to improved depth map quality in both smooth and boundary regions. Additionally, our qualitative analysis has shown that the reconstructed point cloud exhibits enhanced quality without any significant compromise on completeness. Finally, the experiments reveal that our proposed model and strategies exhibit strong generalization capabilities across the various datasets.https://www.mdpi.com/2072-4292/15/12/2970multi-view stereo3D reconstructiondepth map refinementdepth boundary estimation
spellingShingle Nail Ibrahimli
Hugo Ledoux
Julian F. P. Kooij
Liangliang Nan
DDL-MVS: Depth Discontinuity Learning for Multi-View Stereo Networks
Remote Sensing
multi-view stereo
3D reconstruction
depth map refinement
depth boundary estimation
title DDL-MVS: Depth Discontinuity Learning for Multi-View Stereo Networks
title_full DDL-MVS: Depth Discontinuity Learning for Multi-View Stereo Networks
title_fullStr DDL-MVS: Depth Discontinuity Learning for Multi-View Stereo Networks
title_full_unstemmed DDL-MVS: Depth Discontinuity Learning for Multi-View Stereo Networks
title_short DDL-MVS: Depth Discontinuity Learning for Multi-View Stereo Networks
title_sort ddl mvs depth discontinuity learning for multi view stereo networks
topic multi-view stereo
3D reconstruction
depth map refinement
depth boundary estimation
url https://www.mdpi.com/2072-4292/15/12/2970
work_keys_str_mv AT nailibrahimli ddlmvsdepthdiscontinuitylearningformultiviewstereonetworks
AT hugoledoux ddlmvsdepthdiscontinuitylearningformultiviewstereonetworks
AT julianfpkooij ddlmvsdepthdiscontinuitylearningformultiviewstereonetworks
AT liangliangnan ddlmvsdepthdiscontinuitylearningformultiviewstereonetworks