A Survey of Vision and Language Related Multi-Modal Task

With the significant breakthrough in the research of single-modal related deep learning tasks, more and more works begin to focus on multi-modal tasks. Multi-modal tasks usually involve more than one different modalities, and a modality represents a type of behavior or state. Common multi-modal info...

Full description

Bibliographic Details
Main Authors: Lanxiao Wang, Wenzhe Hu, Heqian Qiu, Chao Shang, Taijin Zhao, Benliu Qiu, King Ngi Ngan, Hongliang Li
Format: Article
Language:English
Published: Tsinghua University Press 2022-12-01
Series:CAAI Artificial Intelligence Research
Subjects:
Online Access:https://www.sciopen.com/article/10.26599/AIR.2022.9150008