Learning Adequate Alignment and Interaction for Cross-Modal Retrieval

Cross-modal retrieval has attracted widespread attention in many cross-media similarity search applications, especially image-text retrieval in the fields of computer vision and natural language processing. Recently, visual and semantic embedding (VSE) learning has shown promising improvements on im...

Full description

Bibliographic Details
Main Authors: MingKang Wang, Min Meng, Jigang Liu, Jigang Wu
Format: Article
Language:English
Published: KeAi Communications Co., Ltd. 2023-12-01
Series:Virtual Reality & Intelligent Hardware
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S209657962300027X