Discriminative feature encoding for intrinsic image decomposition

Abstract Intrinsic image decomposition is an important and long-standing computer vision problem. Given an input image, recovering the physical scene properties is ill-posed. Several physically motivated priors have been used to restrict the solution space of the optimization problem for intrinsic i...

Full description

Bibliographic Details
Main Authors:	Zongji Wang, Yunfei Liu, Feng Lu
Format:	Article
Language:	English
Published:	SpringerOpen 2023-04-01
Series:	Computational Visual Media
Subjects:	intrinsic image decomposition deep learning feature distribution data refinement
Online Access:	https://doi.org/10.1007/s41095-022-0294-4

_version_	1797822711546773504
author	Zongji Wang Yunfei Liu Feng Lu
author_facet	Zongji Wang Yunfei Liu Feng Lu
author_sort	Zongji Wang
collection	DOAJ
description	Abstract Intrinsic image decomposition is an important and long-standing computer vision problem. Given an input image, recovering the physical scene properties is ill-posed. Several physically motivated priors have been used to restrict the solution space of the optimization problem for intrinsic image decomposition. This work takes advantage of deep learning, and shows that it can solve this challenging computer vision problem with high efficiency. The focus lies in the feature encoding phase to extract discriminative features for different intrinsic layers from an input image. To achieve this goal, we explore the distinctive characteristics of different intrinsic components in the high-dimensional feature embedding space. We define feature distribution divergence to efficiently separate the feature vectors of different intrinsic components. The feature distributions are also constrained to fit the real ones through a feature distribution consistency. In addition, a data refinement approach is provided to remove data inconsistency from the Sintel dataset, making it more suitable for intrinsic image decomposition. Our method is also extended to intrinsic video decomposition based on pixel-wise correspondences between adjacent frames. Experimental results indicate that our proposed network structure can outperform the existing state-of-the-art.
first_indexed	2024-03-13T10:13:10Z
format	Article
id	doaj.art-01da38f739cb4f7bb5b4e407762fbfd9
institution	Directory Open Access Journal
issn	2096-0433 2096-0662
language	English
last_indexed	2024-03-13T10:13:10Z
publishDate	2023-04-01
publisher	SpringerOpen
record_format	Article
series	Computational Visual Media
spelling	doaj.art-01da38f739cb4f7bb5b4e407762fbfd92023-05-21T11:23:06ZengSpringerOpenComputational Visual Media2096-04332096-06622023-04-019359761810.1007/s41095-022-0294-4Discriminative feature encoding for intrinsic image decompositionZongji Wang0Yunfei Liu1Feng Lu2Key Laboratory of Network Information System Technology (NIST), Aerospace Information Research Institute, Chinese Academy of SciencesState Key Laboratory of Virtual Reality Technology and Systems, School of Computer Science and Engineering, Beihang UniversityState Key Laboratory of Virtual Reality Technology and Systems, School of Computer Science and Engineering, Beihang UniversityAbstract Intrinsic image decomposition is an important and long-standing computer vision problem. Given an input image, recovering the physical scene properties is ill-posed. Several physically motivated priors have been used to restrict the solution space of the optimization problem for intrinsic image decomposition. This work takes advantage of deep learning, and shows that it can solve this challenging computer vision problem with high efficiency. The focus lies in the feature encoding phase to extract discriminative features for different intrinsic layers from an input image. To achieve this goal, we explore the distinctive characteristics of different intrinsic components in the high-dimensional feature embedding space. We define feature distribution divergence to efficiently separate the feature vectors of different intrinsic components. The feature distributions are also constrained to fit the real ones through a feature distribution consistency. In addition, a data refinement approach is provided to remove data inconsistency from the Sintel dataset, making it more suitable for intrinsic image decomposition. Our method is also extended to intrinsic video decomposition based on pixel-wise correspondences between adjacent frames. Experimental results indicate that our proposed network structure can outperform the existing state-of-the-art.https://doi.org/10.1007/s41095-022-0294-4intrinsic image decompositiondeep learningfeature distributiondata refinement
spellingShingle	Zongji Wang Yunfei Liu Feng Lu Discriminative feature encoding for intrinsic image decomposition Computational Visual Media intrinsic image decomposition deep learning feature distribution data refinement
title	Discriminative feature encoding for intrinsic image decomposition
title_full	Discriminative feature encoding for intrinsic image decomposition
title_fullStr	Discriminative feature encoding for intrinsic image decomposition
title_full_unstemmed	Discriminative feature encoding for intrinsic image decomposition
title_short	Discriminative feature encoding for intrinsic image decomposition
title_sort	discriminative feature encoding for intrinsic image decomposition
topic	intrinsic image decomposition deep learning feature distribution data refinement
url	https://doi.org/10.1007/s41095-022-0294-4
work_keys_str_mv	AT zongjiwang discriminativefeatureencodingforintrinsicimagedecomposition AT yunfeiliu discriminativefeatureencodingforintrinsicimagedecomposition AT fenglu discriminativefeatureencodingforintrinsicimagedecomposition

Discriminative feature encoding for intrinsic image decomposition

Similar Items