Flamingo: a visual language model for few-shot learning
Building models that can be rapidly adapted to novel tasks using only a handful of annotated examples is an open challenge for multimodal machine learning research. We introduce Flamingo, a family of Visual Language Models (VLM) with this ability. We propose key architectural innovations to: (i) bri...
Main Authors: | , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Format: | Conference item |
Language: | English |
Published: |
NeurIPS Proceedings
2022
|