DRI-MVSNet: A depth residual inference network for multi-view stereo images.

Three-dimensional (3D) image reconstruction is an important field of computer vision for restoring the 3D geometry of a given scene. Due to the demand for large amounts of memory, prevalent methods of 3D reconstruction yield inaccurate results, because of which the highly accuracy reconstruction of...

Full description

Bibliographic Details
Main Authors:	Ying Li, Wenyue Li, Zhijie Zhao, JiaHao Fan
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2022-01-01
Series:	PLoS ONE
Online Access:	https://doi.org/10.1371/journal.pone.0264721

_version_	1819158919254114304
author	Ying Li Wenyue Li Zhijie Zhao JiaHao Fan
author_facet	Ying Li Wenyue Li Zhijie Zhao JiaHao Fan
author_sort	Ying Li
collection	DOAJ
description	Three-dimensional (3D) image reconstruction is an important field of computer vision for restoring the 3D geometry of a given scene. Due to the demand for large amounts of memory, prevalent methods of 3D reconstruction yield inaccurate results, because of which the highly accuracy reconstruction of a scene remains an outstanding challenge. This study proposes a cascaded depth residual inference network, called DRI-MVSNet, that uses a cross-view similarity-based feature map fusion module for residual inference. It involves three improvements. First, a combined module is used for processing channel-related and spatial information to capture the relevant contextual information and improve feature representation. It combines the channel attention mechanism and spatial pooling networks. Second, a cross-view similarity-based feature map fusion module is proposed that learns the similarity between pairs of pixel in each source and reference image at planes of different depths along the frustum of the reference camera. Third, a deep, multi-stage residual prediction module is designed to generate a high-precision depth map that uses a non-uniform depth sampling strategy to construct hypothetical depth planes. The results of extensive experiments show that DRI-MVSNet delivers competitive performance on the DTU and the Tanks & Temples datasets, and the accuracy and completeness of the point cloud reconstructed by it are significantly superior to those of state-of-the-art benchmarks.
first_indexed	2024-12-22T16:32:18Z
format	Article
id	doaj.art-e5ad4fcabfed4f98bbb3adc775357ffd
institution	Directory Open Access Journal
issn	1932-6203
language	English
last_indexed	2024-12-22T16:32:18Z
publishDate	2022-01-01
publisher	Public Library of Science (PLoS)
record_format	Article
series	PLoS ONE
spelling	doaj.art-e5ad4fcabfed4f98bbb3adc775357ffd2022-12-21T18:20:01ZengPublic Library of Science (PLoS)PLoS ONE1932-62032022-01-01173e026472110.1371/journal.pone.0264721DRI-MVSNet: A depth residual inference network for multi-view stereo images.Ying LiWenyue LiZhijie ZhaoJiaHao FanThree-dimensional (3D) image reconstruction is an important field of computer vision for restoring the 3D geometry of a given scene. Due to the demand for large amounts of memory, prevalent methods of 3D reconstruction yield inaccurate results, because of which the highly accuracy reconstruction of a scene remains an outstanding challenge. This study proposes a cascaded depth residual inference network, called DRI-MVSNet, that uses a cross-view similarity-based feature map fusion module for residual inference. It involves three improvements. First, a combined module is used for processing channel-related and spatial information to capture the relevant contextual information and improve feature representation. It combines the channel attention mechanism and spatial pooling networks. Second, a cross-view similarity-based feature map fusion module is proposed that learns the similarity between pairs of pixel in each source and reference image at planes of different depths along the frustum of the reference camera. Third, a deep, multi-stage residual prediction module is designed to generate a high-precision depth map that uses a non-uniform depth sampling strategy to construct hypothetical depth planes. The results of extensive experiments show that DRI-MVSNet delivers competitive performance on the DTU and the Tanks & Temples datasets, and the accuracy and completeness of the point cloud reconstructed by it are significantly superior to those of state-of-the-art benchmarks.https://doi.org/10.1371/journal.pone.0264721
spellingShingle	Ying Li Wenyue Li Zhijie Zhao JiaHao Fan DRI-MVSNet: A depth residual inference network for multi-view stereo images. PLoS ONE
title	DRI-MVSNet: A depth residual inference network for multi-view stereo images.
title_full	DRI-MVSNet: A depth residual inference network for multi-view stereo images.
title_fullStr	DRI-MVSNet: A depth residual inference network for multi-view stereo images.
title_full_unstemmed	DRI-MVSNet: A depth residual inference network for multi-view stereo images.
title_short	DRI-MVSNet: A depth residual inference network for multi-view stereo images.
title_sort	dri mvsnet a depth residual inference network for multi view stereo images
url	https://doi.org/10.1371/journal.pone.0264721
work_keys_str_mv	AT yingli drimvsnetadepthresidualinferencenetworkformultiviewstereoimages AT wenyueli drimvsnetadepthresidualinferencenetworkformultiviewstereoimages AT zhijiezhao drimvsnetadepthresidualinferencenetworkformultiviewstereoimages AT jiahaofan drimvsnetadepthresidualinferencenetworkformultiviewstereoimages

DRI-MVSNet: A depth residual inference network for multi-view stereo images.

Similar Items