Multi‐cue combination network for action‐based video classification
Action‐based video classification (or video‐based action recognition) is an active research area in computer vision. However, all currently utilised action‐based video classification approaches take spatial and temporal components into consideration while acoustic features (e.g. sound and speech) ar...
Main Authors: | Yan Tian, Yifan Cao, Jiachen Wu, Wei Hu, Chao Song, Tao Yang |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2019-09-01
|
Series: | IET Computer Vision |
Subjects: | |
Online Access: | https://doi.org/10.1049/iet-cvi.2018.5492 |
Similar Items
-
Mutual modality learning for video action classification
by: S.A. Komkov, et al.
Published: (2023-08-01) -
Meta‐action descriptor for action recognition in RGBD video
by: Min Huang, et al.
Published: (2017-06-01) -
A Multi-Scale Video Longformer Network for Action Recognition
by: Congping Chen, et al.
Published: (2024-01-01) -
Efficient Transformer-Based Compressed Video Modeling via Informative Patch Selection
by: Tomoyuki Suzuki, et al.
Published: (2022-12-01) -
Skeletal Keypoint-Based Transformer Model for Human Action Recognition in Aerial Videos
by: Shahab Uddin, et al.
Published: (2024-01-01)