A Fusion Encoder with Multi-Task Guidance for Cross-Modal Text–Image Retrieval in Remote Sensing
In recent years, there has been a growing interest in remote sensing image–text cross-modal retrieval due to the rapid development of space information technology and the significant increase in the volume of remote sensing image data. Remote sensing images have unique characteristics that make the...
Main Authors: | Xiong Zhang, Weipeng Li, Xu Wang, Luyao Wang, Fuzhong Zheng, Long Wang, Haisu Zhang |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-09-01
|
Series: | Remote Sensing |
Subjects: | |
Online Access: | https://www.mdpi.com/2072-4292/15/18/4637 |
Similar Items
-
A Cross-Attention Mechanism Based on Regional-Level Semantic Features of Images for Cross-Modal Text-Image Retrieval in Remote Sensing
by: Fuzhong Zheng, et al.
Published: (2022-11-01) -
A Fine-Grained Semantic Alignment Method Specific to Aggregate Multi-Scale Information for Cross-Modal Remote Sensing Image Retrieval
by: Fuzhong Zheng, et al.
Published: (2023-10-01) -
Exploring latent weight factors and global information for food-oriented cross-modal retrieval
by: Wenyu Zhao, et al.
Published: (2023-12-01) -
The State of the Art for Cross-Modal Retrieval: A Survey
by: Kun Zhou, et al.
Published: (2023-01-01) -
Deep Self-Supervised Hashing With Fine-Grained Similarity Mining for Cross-Modal Retrieval
by: Lijun Han, et al.
Published: (2024-01-01)