Video Action Recognition Using Motion and Multi-View Excitation with Temporal Aggregation

Spatiotemporal and motion feature representations are the key to video action recognition. Typical previous approaches are to utilize 3D CNNs to cope with both spatial and temporal features, but they suffer from huge computations. Other approaches are to utilize (1+2)D CNNs to learn spatial and temp...

Full description

Bibliographic Details
Main Authors: Yuri Yudhaswana Joefrie, Masaki Aono
Format: Article
Language:English
Published: MDPI AG 2022-11-01
Series:Entropy
Subjects:
Online Access:https://www.mdpi.com/1099-4300/24/11/1663