Video representation learning by dense predictive coding

The objective of this paper is self-supervised learning of spatio-temporal embeddings from video, suitable for human action recognition. We make three contributions: First, we introduce the Dense Predictive Coding (DPC) framework for selfsupervised representation learning on videos. This learns a de...

Full description

Bibliographic Details
Main Authors: Han, T, Xie, W, Zisserman, A
Format: Conference item
Language:English
Published: Computer Vision Foundation 2019