Few-shot action recognition with permutation-invariant attention

Many few-shot learning models focus on recognising images. In contrast, we tackle a challenging task of few-shot action recognition from videos. We build on a C3D encoder for spatio-temporal video blocks to capture short-range action patterns. Such encoded blocks are aggregated by permutation-invari...

Full description

Bibliographic Details
Main Authors: Zhang, H, Zhang, L, Qi, X, Li, H, Torr, PHS, Koniusz, P
Format: Conference item
Language:English
Published: Springer 2020