Deep Vision Multimodal Learning: Methodology, Benchmark, and Trend

Deep vision multimodal learning aims at combining deep visual representation learning with other modalities, such as text, sound, and data collected from other sensors. With the fast development of deep learning, vision multimodal learning has gained much interest from the community. This paper revi...

Full description

Bibliographic Details
Main Authors: Wenhao Chai, Gaoang Wang
Format: Article
Language:English
Published: MDPI AG 2022-06-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/12/13/6588