As firm as their foundations: creating transferable adversarial examples across downstream tasks with CLIP

Foundation models pre-trained on web-scale vision-language data, such as CLIP, are widely used as cornerstones of powerful machine learning systems. While pre-training offers clear advantages for downstream learning, it also endows downstream models with shared adversarial vulnerabilities that can b...

Cijeli opis

Bibliografski detalji
Glavni autori: Hu, A, Gu, J, Pinto, F, Kamnitsas, K, Torr, PHS
Format: Conference item
Jezik:English
Izdano: British Machine Vision Association 2024