Image–text coherence and its implications for multimodal AI

Human communication often combines imagery and text into integrated presentations, especially online. In this paper, we show how image–text coherence relations can be used to model the pragmatics of image–text presentations in AI systems. In contrast to alternative frameworks that characterize image...

Full description

Bibliographic Details
Main Authors: Malihe Alikhani, Baber Khalid, Matthew Stone
Format: Article
Language:English
Published: Frontiers Media S.A. 2023-05-01
Series:Frontiers in Artificial Intelligence
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/frai.2023.1048874/full