LGANet: Local and global attention are both you need for action recognition

Abstract Due to redundancy in the spatiotemporal neighborhood and the global dependency between video frames, video recognition remains a challenge. Some prior works have been mainly driven by 3D convolutional neural networks (CNNs) or 2D CNNs with a well‐designed module for temporal information. Ho...

Full description

Bibliographic Details
Main Authors:	Hao Wang, Bin Zhao, Wenjia Zhang, Guohua Liu
Format:	Article
Language:	English
Published:	Wiley 2023-10-01
Series:	IET Image Processing
Subjects:	Action recognition Convolutional neural networks Efficient Transformer Video understanding
Online Access:	https://doi.org/10.1049/ipr2.12876

Internet

https://doi.org/10.1049/ipr2.12876

LGANet: Local and global attention are both you need for action recognition

Internet

Similar Items