Deep Vision Multimodal Learning: Methodology, Benchmark, and Trend

Deep vision multimodal learning aims at combining deep visual representation learning with other modalities, such as text, sound, and data collected from other sensors. With the fast development of deep learning, vision multimodal learning has gained much interest from the community. This paper revi...

Повний опис

Бібліографічні деталі
Автори: Wenhao Chai, Gaoang Wang
Формат: Стаття
Мова:English
Опубліковано: MDPI AG 2022-06-01
Серія:Applied Sciences
Предмети:
Онлайн доступ:https://www.mdpi.com/2076-3417/12/13/6588