Frozen in time: A joint video and image encoder for end-to-end retrieval

Our objective in this work is video-text retrieval – in particular a joint embedding that enables efficient text-to-video retrieval. The challenges in this area include the design of the visual architecture and the nature of the training data, in that the available large scale video-text training da...

全面介紹

書目詳細資料
Main Authors:	Bain, M, Nagrani, A, Varol, G, Zisserman, A
格式:	Conference item
語言:	English
出版:	IEEE 2022

Frozen in time: A joint video and image encoder for end-to-end retrieval

相似書籍