Relating by contrasting: A data-efficient framework for multimodal DGMs
Multimodal learning for generative models often refers to the learning of abstract concepts from the commonality of information in multiple modalities, such as vision and language. While it has proven effective for learning generalisable representations, the training of such models often requires a...
Main Authors: | , , , |
---|---|
Format: | Conference item |
Language: | English |
Published: |
Open Review
2021
|