Skeleton Based Action Recognition Algorithm on Multi-modal Lightweight Graph Convolutional Network

Compared with the traditional RGB-based methods, the skeleton-based action recognition methods have become the main research direction in the field of computer vision in recent years because they are less affected by many factors such as illumination, viewing angle and background complexity. However...

Full description

Bibliographic Details
Main Author: SU Jiangyi, SONG Xiaoning, WU Xiaojun, YU Dongjun
Format: Article
Language:zho
Published: Journal of Computer Engineering and Applications Beijing Co., Ltd., Science Press 2021-04-01
Series:Jisuanji kexue yu tansuo
Subjects:
Online Access:http://fcst.ceaj.org/CN/abstract/abstract2658.shtml
_version_ 1828916841091694592
author SU Jiangyi, SONG Xiaoning, WU Xiaojun, YU Dongjun
author_facet SU Jiangyi, SONG Xiaoning, WU Xiaojun, YU Dongjun
author_sort SU Jiangyi, SONG Xiaoning, WU Xiaojun, YU Dongjun
collection DOAJ
description Compared with the traditional RGB-based methods, the skeleton-based action recognition methods have become the main research direction in the field of computer vision in recent years because they are less affected by many factors such as illumination, viewing angle and background complexity. However, the current skeleton-based methods still have some problems such as large parameters, long time-consuming and high computational complexity, which makes it complicated and difficult to meet the requirements of efficiency and accuracy simultaneously. To address these issues, a lightweight graph convolution network using multi-modal data fusion is proposed. Firstly, the multi-modal information flow data are fused by multi-modal data fusion method. Secondly, the spatial and temporal information of human joints are extracted using spatial and temporal modules respectively. Finally, the classification results are obtained through the fully connected layer. Experimental results conducted on the two commonly used datasets including NTU60 RGB+D and NTU120 RGB+D demonstrate that the proposed network outperforms some mainstream methods in the last two years in both recognition accuracy and efficiency, thus verifying that the network has excellent performance in terms of accuracy, while considering time efficiency and computational cost.
first_indexed 2024-12-13T20:40:53Z
format Article
id doaj.art-17f6bac1026b4ac0b9d0d72221daad66
institution Directory Open Access Journal
issn 1673-9418
language zho
last_indexed 2024-12-13T20:40:53Z
publishDate 2021-04-01
publisher Journal of Computer Engineering and Applications Beijing Co., Ltd., Science Press
record_format Article
series Jisuanji kexue yu tansuo
spelling doaj.art-17f6bac1026b4ac0b9d0d72221daad662022-12-21T23:32:09ZzhoJournal of Computer Engineering and Applications Beijing Co., Ltd., Science PressJisuanji kexue yu tansuo1673-94182021-04-0115473374210.3778/j.issn.1673-9418.2008051Skeleton Based Action Recognition Algorithm on Multi-modal Lightweight Graph Convolutional NetworkSU Jiangyi, SONG Xiaoning, WU Xiaojun, YU Dongjun01. School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi, Jiangsu 214122, China 2. School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, ChinaCompared with the traditional RGB-based methods, the skeleton-based action recognition methods have become the main research direction in the field of computer vision in recent years because they are less affected by many factors such as illumination, viewing angle and background complexity. However, the current skeleton-based methods still have some problems such as large parameters, long time-consuming and high computational complexity, which makes it complicated and difficult to meet the requirements of efficiency and accuracy simultaneously. To address these issues, a lightweight graph convolution network using multi-modal data fusion is proposed. Firstly, the multi-modal information flow data are fused by multi-modal data fusion method. Secondly, the spatial and temporal information of human joints are extracted using spatial and temporal modules respectively. Finally, the classification results are obtained through the fully connected layer. Experimental results conducted on the two commonly used datasets including NTU60 RGB+D and NTU120 RGB+D demonstrate that the proposed network outperforms some mainstream methods in the last two years in both recognition accuracy and efficiency, thus verifying that the network has excellent performance in terms of accuracy, while considering time efficiency and computational cost.http://fcst.ceaj.org/CN/abstract/abstract2658.shtmlaction recognitionhuman skeletonlightweightgraph convolutional network
spellingShingle SU Jiangyi, SONG Xiaoning, WU Xiaojun, YU Dongjun
Skeleton Based Action Recognition Algorithm on Multi-modal Lightweight Graph Convolutional Network
Jisuanji kexue yu tansuo
action recognition
human skeleton
lightweight
graph convolutional network
title Skeleton Based Action Recognition Algorithm on Multi-modal Lightweight Graph Convolutional Network
title_full Skeleton Based Action Recognition Algorithm on Multi-modal Lightweight Graph Convolutional Network
title_fullStr Skeleton Based Action Recognition Algorithm on Multi-modal Lightweight Graph Convolutional Network
title_full_unstemmed Skeleton Based Action Recognition Algorithm on Multi-modal Lightweight Graph Convolutional Network
title_short Skeleton Based Action Recognition Algorithm on Multi-modal Lightweight Graph Convolutional Network
title_sort skeleton based action recognition algorithm on multi modal lightweight graph convolutional network
topic action recognition
human skeleton
lightweight
graph convolutional network
url http://fcst.ceaj.org/CN/abstract/abstract2658.shtml
work_keys_str_mv AT sujiangyisongxiaoningwuxiaojunyudongjun skeletonbasedactionrecognitionalgorithmonmultimodallightweightgraphconvolutionalnetwork