Global Guided Cross-Modal Cross-Scale Network for RGB-D Salient Object Detection

RGB-D saliency detection aims to accurately localize salient regions using the complementary information of a depth map. Global contexts carried by the deep layer are key to salient objection detection, but they are diluted when transferred to shallower layers. Besides, depth maps may contain mislea...

ver descrição completa

Detalhes bibliográficos
Principais autores:	Shuaihui Wang, Fengyi Jiang, Boqian Xu
Formato:	Artigo
Idioma:	English
Publicado em:	MDPI AG 2023-08-01
coleção:	Sensors
Assuntos:	RGB-D salient object detection global guidance cross-modal cross-scale fusion
Acesso em linha:	https://www.mdpi.com/1424-8220/23/16/7221

_version_	1827728695833395200
author	Shuaihui Wang Fengyi Jiang Boqian Xu
author_facet	Shuaihui Wang Fengyi Jiang Boqian Xu
author_sort	Shuaihui Wang
collection	DOAJ
description	RGB-D saliency detection aims to accurately localize salient regions using the complementary information of a depth map. Global contexts carried by the deep layer are key to salient objection detection, but they are diluted when transferred to shallower layers. Besides, depth maps may contain misleading information due to the depth sensors. To tackle these issues, in this paper, we propose a new cross-modal cross-scale network for RGB-D salient object detection, where the global context information provides global guidance to boost performance in complex scenarios. First, we introduce a global guided cross-modal and cross-scale module named G<sup>2</sup>CMCSM to realize global guided cross-modal cross-scale fusion. Then, we employ feature refinement modules for progressive refinement in a coarse-to-fine manner. In addition, we adopt a hybrid loss function to supervise the training of G<sup>2</sup>CMCSNet over different scales. With all these modules working together, G<sup>2</sup>CMCSNet effectively enhances both salient object details and salient object localization. Extensive experiments on challenging benchmark datasets demonstrate that our G<sup>2</sup>CMCSNet outperforms existing state-of-the-art methods.
first_indexed	2024-03-10T23:35:38Z
format	Article
id	doaj.art-6b8a3b709164402aa1e68cbf099bab72
institution	Directory Open Access Journal
issn	1424-8220
language	English
last_indexed	2024-03-10T23:35:38Z
publishDate	2023-08-01
publisher	MDPI AG
record_format	Article
series	Sensors
spelling	doaj.art-6b8a3b709164402aa1e68cbf099bab722023-11-19T02:58:31ZengMDPI AGSensors1424-82202023-08-012316722110.3390/s23167221Global Guided Cross-Modal Cross-Scale Network for RGB-D Salient Object DetectionShuaihui Wang0Fengyi Jiang1Boqian Xu2Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, ChinaChangchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, ChinaChangchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, ChinaRGB-D saliency detection aims to accurately localize salient regions using the complementary information of a depth map. Global contexts carried by the deep layer are key to salient objection detection, but they are diluted when transferred to shallower layers. Besides, depth maps may contain misleading information due to the depth sensors. To tackle these issues, in this paper, we propose a new cross-modal cross-scale network for RGB-D salient object detection, where the global context information provides global guidance to boost performance in complex scenarios. First, we introduce a global guided cross-modal and cross-scale module named G<sup>2</sup>CMCSM to realize global guided cross-modal cross-scale fusion. Then, we employ feature refinement modules for progressive refinement in a coarse-to-fine manner. In addition, we adopt a hybrid loss function to supervise the training of G<sup>2</sup>CMCSNet over different scales. With all these modules working together, G<sup>2</sup>CMCSNet effectively enhances both salient object details and salient object localization. Extensive experiments on challenging benchmark datasets demonstrate that our G<sup>2</sup>CMCSNet outperforms existing state-of-the-art methods.https://www.mdpi.com/1424-8220/23/16/7221RGB-D salient object detectionglobal guidancecross-modal cross-scale fusion
spellingShingle	Shuaihui Wang Fengyi Jiang Boqian Xu Global Guided Cross-Modal Cross-Scale Network for RGB-D Salient Object Detection Sensors RGB-D salient object detection global guidance cross-modal cross-scale fusion
title	Global Guided Cross-Modal Cross-Scale Network for RGB-D Salient Object Detection
title_full	Global Guided Cross-Modal Cross-Scale Network for RGB-D Salient Object Detection
title_fullStr	Global Guided Cross-Modal Cross-Scale Network for RGB-D Salient Object Detection
title_full_unstemmed	Global Guided Cross-Modal Cross-Scale Network for RGB-D Salient Object Detection
title_short	Global Guided Cross-Modal Cross-Scale Network for RGB-D Salient Object Detection
title_sort	global guided cross modal cross scale network for rgb d salient object detection
topic	RGB-D salient object detection global guidance cross-modal cross-scale fusion
url	https://www.mdpi.com/1424-8220/23/16/7221
work_keys_str_mv	AT shuaihuiwang globalguidedcrossmodalcrossscalenetworkforrgbdsalientobjectdetection AT fengyijiang globalguidedcrossmodalcrossscalenetworkforrgbdsalientobjectdetection AT boqianxu globalguidedcrossmodalcrossscalenetworkforrgbdsalientobjectdetection

Global Guided Cross-Modal Cross-Scale Network for RGB-D Salient Object Detection

Registros relacionados