Video representation learning by dense predictive coding

The objective of this paper is self-supervised learning of spatio-temporal embeddings from video, suitable for human action recognition. We make three contributions: First, we introduce the Dense Predictive Coding (DPC) framework for selfsupervised representation learning on videos. This learns a de...

Volledige beschrijving

Bibliografische gegevens
Hoofdauteurs:	Han, T, Xie, W, Zisserman, A
Formaat:	Conference item
Taal:	English
Gepubliceerd in:	Computer Vision Foundation 2019

Video representation learning by dense predictive coding

Gelijkaardige items