Video representation learning by dense predictive coding
The objective of this paper is self-supervised learning of spatio-temporal embeddings from video, suitable for human action recognition. We make three contributions: First, we introduce the Dense Predictive Coding (DPC) framework for selfsupervised representation learning on videos. This learns a de...
Main Authors: | , , |
---|---|
Format: | Conference item |
Language: | English |
Published: |
Computer Vision Foundation
2019
|