Flamingo: a visual language model for few-shot learning

Building models that can be rapidly adapted to novel tasks using only a handful of annotated examples is an open challenge for multimodal machine learning research. We introduce Flamingo, a family of Visual Language Models (VLM) with this ability. We propose key architectural innovations to: (i) bri...

Full description

Bibliographic Details
Main Authors: Alayrac, J-B, Donahue, J, Luc, P, Miech, A, Barr, I, Hasson, Y, Lenc, K, Mensch, A, Millican, K, Reynolds, M, Ring, R, Rutherford, E, Cabi, S, Han, T, Gong, Z, Samangooei, S, Monteiro, M, Menick, J, Borgeaud, S, Brock, A, Nematzadeh, A, Sharifzadeh, S, Binkowski, M, Barreira, R, Vinyals, O, Zisserman, A, Simonyan, K
Format: Conference item
Language:English
Published: NeurIPS Proceedings 2022