Deep robust multilevel semantic hashing for multi-label cross-modal retrieval

Hashing based cross-modal retrieval has recently made significant progress. But straightforward embedding data from different modalities involving rich semantics into a joint Hamming space will inevitably produce false codes due to the intrinsic modality discrepancy and noises. We present a novel d...

Full description

Bibliographic Details
Main Authors:	Song, Ge, Tan, Xiaoyang, Zhao, Jun, Yang, Ming
Other Authors:	School of Computer Science and Engineering
Format:	Journal Article
Language:	English
Published:	2023
Subjects:	Engineering::Computer science and engineering Hashing Multi-Label
Online Access:	https://hdl.handle.net/10356/164098

_version_	1824456267577425920
author	Song, Ge Tan, Xiaoyang Zhao, Jun Yang, Ming
author2	School of Computer Science and Engineering
author_facet	School of Computer Science and Engineering Song, Ge Tan, Xiaoyang Zhao, Jun Yang, Ming
author_sort	Song, Ge
collection	NTU
description	Hashing based cross-modal retrieval has recently made significant progress. But straightforward embedding data from different modalities involving rich semantics into a joint Hamming space will inevitably produce false codes due to the intrinsic modality discrepancy and noises. We present a novel deep Robust Multilevel Semantic Hashing (RMSH) for more accurate multi-label cross-modal retrieval. It seeks to preserve fine-grained similarity among data with rich semantics,i.e., multi-label, while explicitly require distances between dissimilar points to be larger than a specific value for strong robustness. For this, we give an effective bound of this value based on the information coding-theoretic analysis, and the above goals are embodied into a margin-adaptive triplet loss. Furthermore, we introduce pseudo-codes via fusing multiple hash codes to explore seldom-seen semantics, alleviating the sparsity problem of similarity information. Experiments on three benchmarks show the validity of the derived bounds, and our method achieves state-of-the-art performance.
first_indexed	2025-02-19T03:51:23Z
format	Journal Article
id	ntu-10356/164098
institution	Nanyang Technological University
language	English
last_indexed	2025-02-19T03:51:23Z
publishDate	2023
record_format	dspace
spelling	ntu-10356/1640982023-01-04T08:31:38Z Deep robust multilevel semantic hashing for multi-label cross-modal retrieval Song, Ge Tan, Xiaoyang Zhao, Jun Yang, Ming School of Computer Science and Engineering Engineering::Computer science and engineering Hashing Multi-Label Hashing based cross-modal retrieval has recently made significant progress. But straightforward embedding data from different modalities involving rich semantics into a joint Hamming space will inevitably produce false codes due to the intrinsic modality discrepancy and noises. We present a novel deep Robust Multilevel Semantic Hashing (RMSH) for more accurate multi-label cross-modal retrieval. It seeks to preserve fine-grained similarity among data with rich semantics,i.e., multi-label, while explicitly require distances between dissimilar points to be larger than a specific value for strong robustness. For this, we give an effective bound of this value based on the information coding-theoretic analysis, and the above goals are embodied into a margin-adaptive triplet loss. Furthermore, we introduce pseudo-codes via fusing multiple hash codes to explore seldom-seen semantics, alleviating the sparsity problem of similarity information. Experiments on three benchmarks show the validity of the derived bounds, and our method achieves state-of-the-art performance. This work is partially supported by National Science Foundation of China (61976115, 61732006, 61876087), Natural Science Foundation of Jiangsu Province (SBK2021043459), AI+ Project of NUAA (NZ2020012,56XZA18009), research project (315025305), and China Scholarship Council (201906830057). 2023-01-04T08:31:38Z 2023-01-04T08:31:38Z 2021 Journal Article Song, G., Tan, X., Zhao, J. & Yang, M. (2021). Deep robust multilevel semantic hashing for multi-label cross-modal retrieval. Pattern Recognition, 120, 108084-. https://dx.doi.org/10.1016/j.patcog.2021.108084 0031-3203 https://hdl.handle.net/10356/164098 10.1016/j.patcog.2021.108084 120 108084 en Pattern Recognition © 2021 Elsevier Ltd. All rights reserved.
spellingShingle	Engineering::Computer science and engineering Hashing Multi-Label Song, Ge Tan, Xiaoyang Zhao, Jun Yang, Ming Deep robust multilevel semantic hashing for multi-label cross-modal retrieval
title	Deep robust multilevel semantic hashing for multi-label cross-modal retrieval
title_full	Deep robust multilevel semantic hashing for multi-label cross-modal retrieval
title_fullStr	Deep robust multilevel semantic hashing for multi-label cross-modal retrieval
title_full_unstemmed	Deep robust multilevel semantic hashing for multi-label cross-modal retrieval
title_short	Deep robust multilevel semantic hashing for multi-label cross-modal retrieval
title_sort	deep robust multilevel semantic hashing for multi label cross modal retrieval
topic	Engineering::Computer science and engineering Hashing Multi-Label
url	https://hdl.handle.net/10356/164098
work_keys_str_mv	AT songge deeprobustmultilevelsemantichashingformultilabelcrossmodalretrieval AT tanxiaoyang deeprobustmultilevelsemantichashingformultilabelcrossmodalretrieval AT zhaojun deeprobustmultilevelsemantichashingformultilabelcrossmodalretrieval AT yangming deeprobustmultilevelsemantichashingformultilabelcrossmodalretrieval

Deep robust multilevel semantic hashing for multi-label cross-modal retrieval

Similar Items