SynthCLIP: are we ready for a fully synthetic CLIP training?

We present SynthCLIP, a novel framework for training CLIP models with entirely synthetic textimage pairs, significantly departing from previous methods relying on real data. Leveraging recent text-to-image (TTI) generative networks and large language models (LLM), we are able to generate synthetic d...

Full description

Bibliographic Details
Main Authors: Hammoud, HAAK, Itani, H, Pizzati, F, Torr, P, Bibi, A, Ghanem, B
Format: Conference item
Language:English
Published: IEEE 2024