FLIPDIAL: A generative model for two-way visual dialogue
We present FLIPDIAL, a generative model for Visual Dialogue that simultaneously plays the role of both participants in a visually-grounded dialogue. Given context in the form of an image and an associated caption summarising the contents of the image, FLIPDIAL learns both to answer questions and put...
Main Authors: | Massiceti, D, Narayanaswamy, S, Torr, P, Dokania, P |
---|---|
格式: | Conference item |
出版: |
Institute of Electrical and Electronics Engineers
2018
|
相似書籍
-
Visual dialogue without vision or dialogue
由: Massiceti, D, et al.
出版: (2018) -
Bottom-up top-down cues for weakly-supervised semantic segmentation
由: Hou, Q, et al.
出版: (2018) -
Multi-agent diverse generative adversarial networks
由: Ghosh, A, et al.
出版: (2018) -
Random forests versus neural networks - What's best for camera localization?
由: Massiceti, D, et al.
出版: (2017) -
A semi-supervised deep generative model for human body analysis
由: De Bem, R, et al.
出版: (2019)