Few-shot action recognition with permutation-invariant attention
Many few-shot learning models focus on recognising images. In contrast, we tackle a challenging task of few-shot action recognition from videos. We build on a C3D encoder for spatio-temporal video blocks to capture short-range action patterns. Such encoded blocks are aggregated by permutation-invari...
Main Authors: | , , , , , |
---|---|
Format: | Conference item |
Language: | English |
Published: |
Springer
2020
|