Labelling unlabelled videos from scratch with multi-modal self-supervision

A large part of the current success of deep learning lies in the effectiveness of data -- more precisely: of labeled data. Yet, labelling a dataset with human annotation continues to carry high costs, especially for videos. While in the image domain, recent methods have allowed to generate meaningfu...

詳細記述

書誌詳細
主要な著者:	Asano, YM, Patrick, M, Rupprecht, C, Vedaldi, A
フォーマット:	Conference item
言語:	English
出版事項:	NeurIPS 2020

Labelling unlabelled videos from scratch with multi-modal self-supervision

類似資料