Personalised CLIP or: how to find your vacation videos

In this paper, our goal is a person-centric model capable of retrieving the image or video corresponding to a personalized compound query from a large set of images or videos. Specifically, given a query consisting of an image of a person's \textit{face} and a text \textit{scene description} or...

Full description

Bibliographic Details
Main Authors: Korbar, B, Zisserman, A
Format: Conference item
Language:English
Published: British Machine Vision Association 2022