LGANet: Local and global attention are both you need for action recognition

Abstract Due to redundancy in the spatiotemporal neighborhood and the global dependency between video frames, video recognition remains a challenge. Some prior works have been mainly driven by 3D convolutional neural networks (CNNs) or 2D CNNs with a well‐designed module for temporal information. Ho...

Full description

Bibliographic Details
Main Authors: Hao Wang, Bin Zhao, Wenjia Zhang, Guohua Liu
Format: Article
Language:English
Published: Wiley 2023-10-01
Series:IET Image Processing
Subjects:
Online Access:https://doi.org/10.1049/ipr2.12876