Personalised CLIP or: how to find your vacation videos
In this paper, our goal is a person-centric model capable of retrieving the image or video corresponding to a personalized compound query from a large set of images or videos. Specifically, given a query consisting of an image of a person's \textit{face} and a text \textit{scene description} or...
Main Authors: | , |
---|---|
Format: | Conference item |
Language: | English |
Published: |
British Machine Vision Association
2022
|