Controllable attention for structured layered video decomposition
The objective of this paper is to be able to separate a video into its natural layers, and to control which of the separated layers to attend to. For example, to be able to separate reflections, transparency or object motion. We make the following three contributions: (i) we introduce a new structur...
Main Authors: | Alayrac, J-B, Carreira, J, Arandjelovic, R, Zisserman, A |
---|---|
Format: | Conference item |
Language: | English |
Published: |
IEEE
2020
|
Similar Items
-
The visual centrifuge: Model-free layered video representations
by: Alayrac, J-B, et al.
Published: (2020) -
Visual grounding in video for unsupervised word translation
by: Sigurdsson, GA, et al.
Published: (2020) -
Video action transformer network
by: Girdhar, R, et al.
Published: (2020) -
Massively parallel video networks
by: Carreira, J, et al.
Published: (2018) -
End-to-end learning of visual representations from uncurated instructional videos
by: Miech, A, et al.
Published: (2020)