Relating by contrasting: A data-efficient framework for multimodal DGMs

Multimodal learning for generative models often refers to the learning of abstract concepts from the commonality of information in multiple modalities, such as vision and language. While it has proven effective for learning generalisable representations, the training of such models often requires a...

Full description

Bibliographic Details
Main Authors:	Shi, Y, Paige, B, Torr, PHS, Siddharth, N
Format:	Conference item
Language:	English
Published:	Open Review 2021

Relating by contrasting: A data-efficient framework for multimodal DGMs

Similar Items