Video representation learning by dense predictive coding
The objective of this paper is self-supervised learning of spatio-temporal embeddings from video, suitable for human action recognition. We make three contributions: First, we introduce the Dense Predictive Coding (DPC) framework for selfsupervised representation learning on videos. This learns a de...
Hoofdauteurs: | , , |
---|---|
Formaat: | Conference item |
Taal: | English |
Gepubliceerd in: |
Computer Vision Foundation
2019
|