Human Action Recognition Based on 3D Convolution and Multi-Attention Transformer

To address the limitations of traditional two-stream networks, such as inadequate spatiotemporal information fusion, limited feature diversity, and insufficient accuracy, we propose an improved two-stream network for human action recognition based on multi-scale attention Transformer and 3D convolut...

Full description

Bibliographic Details
Main Authors: Minghua Liu, Wenjing Li, Bo He, Chuanxu Wang, Lianen Qu
Format: Article
Language:English
Published: MDPI AG 2025-03-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/15/5/2695