FLIPDIAL: A generative model for two-way visual dialogue

We present FLIPDIAL, a generative model for Visual Dialogue that simultaneously plays the role of both participants in a visually-grounded dialogue. Given context in the form of an image and an associated caption summarising the contents of the image, FLIPDIAL learns both to answer questions and put...

詳細記述

書誌詳細
主要な著者: Massiceti, D, Narayanaswamy, S, Torr, P, Dokania, P
フォーマット: Conference item
出版事項: Institute of Electrical and Electronics Engineers 2018