SynthCLIP: are we ready for a fully synthetic CLIP training?
We present SynthCLIP, a novel framework for training CLIP models with entirely synthetic textimage pairs, significantly departing from previous methods relying on real data. Leveraging recent text-to-image (TTI) generative networks and large language models (LLM), we are able to generate synthetic d...
Main Authors: | , , , , , |
---|---|
פורמט: | Conference item |
שפה: | English |
יצא לאור: |
IEEE
2024
|