FLIPDIAL: A generative model for two-way visual dialogue
We present FLIPDIAL, a generative model for Visual Dialogue that simultaneously plays the role of both participants in a visually-grounded dialogue. Given context in the form of an image and an associated caption summarising the contents of the image, FLIPDIAL learns both to answer questions and put...
主要な著者: | Massiceti, D, Narayanaswamy, S, Torr, P, Dokania, P |
---|---|
フォーマット: | Conference item |
出版事項: |
Institute of Electrical and Electronics Engineers
2018
|
類似資料
-
Visual dialogue without vision or dialogue
著者:: Massiceti, D, 等
出版事項: (2018) -
Bottom-up top-down cues for weakly-supervised semantic segmentation
著者:: Hou, Q, 等
出版事項: (2018) -
Multi-agent diverse generative adversarial networks
著者:: Ghosh, A, 等
出版事項: (2018) -
Random forests versus neural networks - What's best for camera localization?
著者:: Massiceti, D, 等
出版事項: (2017) -
A semi-supervised deep generative model for human body analysis
著者:: De Bem, R, 等
出版事項: (2019)