Use what you have: Video retrieval using representations from collaborative experts

Use what you have: Video retrieval using representations from collaborative experts

The rapid growth of video on the internet has made searching for video content using natural language queries a significant challenge. Human generated queries for video datasets ‘in the wild’ vary a lot in terms of degree of specificity, with some queries describing ‘specific details’ such as the na...

Full description

Bibliographic Details
Main Authors:	Liu, Y, Albanie, S, Nagrani, A, Zisserman, A
Format:	Conference item
Language:	English
Published:	British Machine Vision Association 2020

Similar Items

Emotion Recognition in Speech using Cross-Modal Transfer in the Wild
by: Albanie, S, et al.
Published: (2018)

Emotion recognition in speech using cross-modal transfer in the wild
by: Albanie, S, et al.
Published: (2018)

Disentangled Speech Embeddings Using Cross-Modal Self-Supervision
by: Nagrani, A, et al.
Published: (2020)

Learnable PINs: Cross-modal embeddings for person identity
by: Nagrani, A, et al.
Published: (2018)

Seeing voices and hearing faces: Cross-modal biometric matching
by: Nagrani, A, et al.
Published: (2018)

TEACHTEXT: CrossModal generalized distillation for text-video retrieval
by: Croitoru, I, et al.
Published: (2022)

Frozen in time: A joint video and image encoder for end-to-end retrieval
by: Bain, M, et al.
Published: (2022)

A sound approach: using large language models to generate audio descriptions for egocentric text-audio retrieval
by: Oncescu, A-M, et al.
Published: (2024)

Condensed movies: story based retrieval with contextual embeddings
by: Bain, M, et al.
Published: (2021)

What have we learned from deep representations for action recognition?
by: Feichtenhofer, C, et al.
Published: (2018)

QUERYD: a video dataset with high-quality text and audio narrations
by: Oncescu, A-M, et al.
Published: (2021)

Video understanding using multimodal deep learning
by: Nagrani, A
Published: (2020)

Chimpanzee face recognition from videos in the wild using deep learning
by: Schofield, D, et al.
Published: (2019)

Video retrieval by mimicking poses
by: Jammalamadaka, N, et al.
Published: (2012)

Read and attend: temporal localisation in sign language videos
by: Varol, G, et al.
Published: (2021)

Efficient visual search of videos cast as text retrieval.
by: Sivic, J, et al.
Published: (2009)

How having exepertose does not make you the expert
by: Maimie Thompson, et al.
Published: (2015-05-01)

Aligning subtitles in sign language videos
by: Bull, H, et al.
Published: (2022)

Video representation learning by dense predictive coding
by: Han, T, et al.
Published: (2019)

Verbs in action: improving verb understanding in video-language models
by: Momeni, L, et al.
Published: (2024)

Is an object-centric video representation beneficial for transfer?
by: Zhang, C, et al.
Published: (2023)

Self-supervised co-training for video representation learning
by: Han, T, et al.
Published: (2020)

Automatic dense annotation of large-vocabulary sign language videos
by: Momeni, L, et al.
Published: (2022)

Weakly-supervised fingerspelling recognition in British Sign Language videos
by: Prajwal, KR, et al.
Published: (2022)

Sometimes you have to fight for what is right
by: Mohd Rasdi, Roziah
Published: (2013)

Memory-augmented dense predictive coding for video representation learning
by: Han, T, et al.
Published: (2020)

The visual centrifuge: Model-free layered video representations
by: Alayrac, J-B, et al.
Published: (2020)

CONTENT BASED VIDEO RETRIEVAL BASED ON HDWT AND SPARSE REPRESENTATION
by: Sajad Mohamadzadeh, et al.
Published: (2016-04-01)

Will you have a yam? : a study in agency and representation
by: Steinfeld, Kyle Ross, 1975-
Published: (2005)

Introduction: “The Preface of What You Shall Have Been”
by: Philip Armstrong, et al.
Published: (2016-12-01)

From Benedict Cumberbatch to Sherlock Holmes: character identification in TV series without a script
by: Nagrani, A, et al.
Published: (2017)

AutoAD II: The Sequel - who, when, and what in movie audio description
by: Han, T, et al.
Published: (2024)

Smooth Object Retrieval using a Bag of Boundaries
by: Arandjelovic, R, et al.
Published: (2011)

The state of the art: object retrieval in paintings using discriminative regions
by: Crowley, E, et al.
Published: (2014)

How to get what you want without having to ask /
by: Templar, Richard, 1950-2006
Published: (2011)

What you have to know about Human Milk Oligosaccharides
by: Vassilios Fanos, et al.
Published: (2018-04-01)

The Art of Recognizing What You Ought to Have Wanted to Look For
by: Andrew Abbott
Published: (2018-07-01)

“You Do What You Have To Do For The Babies”: The Pregnancy Experiences of Native American Women
by: Jessica Liddell, et al.
Published: (2023-10-01)

Audio retrieval with natural language queries
by: Oncescu, A-M, et al.
Published: (2021)

Space videos on YouTube - what makes the audience tick
by: Roos Maarten, et al.
Published: (2019-01-01)