FLIPDIAL: A generative model for two-way visual dialogue
We present FLIPDIAL, a generative model for Visual Dialogue that simultaneously plays the role of both participants in a visually-grounded dialogue. Given context in the form of an image and an associated caption summarising the contents of the image, FLIPDIAL learns both to answer questions and put...
المؤلفون الرئيسيون: | Massiceti, D, Narayanaswamy, S, Torr, P, Dokania, P |
---|---|
التنسيق: | Conference item |
منشور في: |
Institute of Electrical and Electronics Engineers
2018
|
مواد مشابهة
-
Visual dialogue without vision or dialogue
حسب: Massiceti, D, وآخرون
منشور في: (2018) -
Bottom-up top-down cues for weakly-supervised semantic segmentation
حسب: Hou, Q, وآخرون
منشور في: (2018) -
Multi-agent diverse generative adversarial networks
حسب: Ghosh, A, وآخرون
منشور في: (2018) -
Random forests versus neural networks - What's best for camera localization?
حسب: Massiceti, D, وآخرون
منشور في: (2017) -
A semi-supervised deep generative model for human body analysis
حسب: De Bem, R, وآخرون
منشور في: (2019)