Learning Adequate Alignment and Interaction for Cross-Modal Retrieval
Cross-modal retrieval has attracted widespread attention in many cross-media similarity search applications, especially image-text retrieval in the fields of computer vision and natural language processing. Recently, visual and semantic embedding (VSE) learning has shown promising improvements on im...
Main Authors: | MingKang Wang, Min Meng, Jigang Liu, Jigang Wu |
---|---|
Format: | Article |
Language: | English |
Published: |
KeAi Communications Co., Ltd.
2023-12-01
|
Series: | Virtual Reality & Intelligent Hardware |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S209657962300027X |
Similar Items
-
Text-Image Cross-modal Retrieval Based on Transformer
by: YANG Xiaoyu, LI Chao, CHEN Shunyao, LI Haoliang, YIN Guangqiang
Published: (2023-04-01) -
Cross modal recipe retrieval with fine grained modal interaction
by: Fan Zhao, et al.
Published: (2025-02-01) -
Deep Semantic Cross Modal Hashing Based on Graph Similarity of Modal-Specific
by: Junzheng Li
Published: (2021-01-01) -
Cross-modal retrieval based on multi-dimensional feature fusion hashing
by: Dongxiao Ren, et al.
Published: (2024-06-01) -
On the Limitations of Visual-Semantic Embedding Networks for Image-to-Text Information Retrieval
by: Yan Gong, et al.
Published: (2021-07-01)