FLIPDIAL: A generative model for two-way visual dialogue
We present FLIPDIAL, a generative model for Visual Dialogue that simultaneously plays the role of both participants in a visually-grounded dialogue. Given context in the form of an image and an associated caption summarising the contents of the image, FLIPDIAL learns both to answer questions and put...
Auteurs principaux: | Massiceti, D, Narayanaswamy, S, Torr, P, Dokania, P |
---|---|
Format: | Conference item |
Publié: |
Institute of Electrical and Electronics Engineers
2018
|
Documents similaires
-
Visual dialogue without vision or dialogue
par: Massiceti, D, et autres
Publié: (2018) -
Bottom-up top-down cues for weakly-supervised semantic segmentation
par: Hou, Q, et autres
Publié: (2018) -
Multi-agent diverse generative adversarial networks
par: Ghosh, A, et autres
Publié: (2018) -
Random forests versus neural networks - What's best for camera localization?
par: Massiceti, D, et autres
Publié: (2017) -
A semi-supervised deep generative model for human body analysis
par: De Bem, R, et autres
Publié: (2019)