SynthCLIP: are we ready for a fully synthetic CLIP training?

We present SynthCLIP, a novel framework for training CLIP models with entirely synthetic textimage pairs, significantly departing from previous methods relying on real data. Leveraging recent text-to-image (TTI) generative networks and large language models (LLM), we are able to generate synthetic d...

תיאור מלא

מידע ביבליוגרפי
Main Authors: Hammoud, HAAK, Itani, H, Pizzati, F, Torr, P, Bibi, A, Ghanem, B
פורמט: Conference item
שפה:English
יצא לאור: IEEE 2024