Cross-Channel Graph Convolutional Networks for Skeleton-Based Action Recognition

In recent years, skeleton-based action recognition, graph convolutional networks, have achieved remarkable performance. In these existing works, the features of all nodes in the neighbor set are aggregated into the updated features of the root node, while these features are located in the same featu...

Full description

Bibliographic Details
Main Authors:	Jun Xie, Wentian Xin, Ruyi Liu, Lijie Sheng, Xiangzeng Liu, Xuesong Gao, Sheng Zhong, Lei Tang, Qiguang Miao
Format:	Article
Language:	English
Published:	IEEE 2021-01-01
Series:	IEEE Access
Subjects:	Skeleton-based action recognition graph convolutional network channel attention action recognition
Online Access:	https://ieeexplore.ieee.org/document/9316772/

_version_	1819175906154905600
author	Jun Xie Wentian Xin Ruyi Liu Lijie Sheng Xiangzeng Liu Xuesong Gao Sheng Zhong Lei Tang Qiguang Miao
author_facet	Jun Xie Wentian Xin Ruyi Liu Lijie Sheng Xiangzeng Liu Xuesong Gao Sheng Zhong Lei Tang Qiguang Miao
author_sort	Jun Xie
collection	DOAJ
description	In recent years, skeleton-based action recognition, graph convolutional networks, have achieved remarkable performance. In these existing works, the features of all nodes in the neighbor set are aggregated into the updated features of the root node, while these features are located in the same feature channel determined by the same 1 × 1 convolution filter. This may not be optimal for capturing the features of spatial dimensions among adjacent vertices effectively. Besides, the effect of feature channels that are independent of the current action on the performance of the model is rarely investigated in existing methods. In this paper, we propose cross-channel graph convolutional networks for skeleton-based action recognition. The features fusion mechanism in our network is cross-channel, i.e, the updated feature of the root node is derived from different feature channels. Because different feature channels come from different 1 × 1 convolution filters, the cross-channel fusion mechanism significantly improves the ability of the model to capture local features among adjacent vertices. Moreover, by introducing a channel attention mechanism to our model, we suppress the influence of feature channels unrelated to action recognition on model performance, which improves the robustness of the model against the feature channels independent of the current action. Extensive experiments on the two large-scale datasets, NTU-RGB+D and KineticsSkeleton, demonstrate that the performance of our model exceeds the current mainstream methods.
first_indexed	2024-12-22T21:02:18Z
format	Article
id	doaj.art-e69a6429498f4b65a985254565a5f841
institution	Directory Open Access Journal
issn	2169-3536
language	English
last_indexed	2024-12-22T21:02:18Z
publishDate	2021-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj.art-e69a6429498f4b65a985254565a5f8412022-12-21T18:12:47ZengIEEEIEEE Access2169-35362021-01-0199055906510.1109/ACCESS.2021.30498089316772Cross-Channel Graph Convolutional Networks for Skeleton-Based Action RecognitionJun Xie0https://orcid.org/0000-0002-4359-1045Wentian Xin1https://orcid.org/0000-0002-6479-3092Ruyi Liu2https://orcid.org/0000-0001-9231-1081Lijie Sheng3Xiangzeng Liu4https://orcid.org/0000-0002-2751-6096Xuesong Gao5Sheng Zhong6Lei Tang7Qiguang Miao8https://orcid.org/0000-0001-6766-8310School of Computer Science and Technology, Xidian University, Xi’an, ChinaSchool of Computer Science and Technology, Xidian University, Xi’an, ChinaSchool of Computer Science and Technology, Xidian University, Xi’an, ChinaSchool of Computer Science and Technology, Xidian University, Xi’an, ChinaSchool of Computer Science and Technology, Xidian University, Xi’an, ChinaState Key Laboratory of Digital Multimedia Technology, Hisense Company Ltd., Qingdao, ChinaSchool of Information Science and Technology, Northwest University of China, Xi’an, ChinaXi’an Microelectronics Technology Institute, Xi’an, ChinaSchool of Computer Science and Technology, Xidian University, Xi’an, ChinaIn recent years, skeleton-based action recognition, graph convolutional networks, have achieved remarkable performance. In these existing works, the features of all nodes in the neighbor set are aggregated into the updated features of the root node, while these features are located in the same feature channel determined by the same 1 × 1 convolution filter. This may not be optimal for capturing the features of spatial dimensions among adjacent vertices effectively. Besides, the effect of feature channels that are independent of the current action on the performance of the model is rarely investigated in existing methods. In this paper, we propose cross-channel graph convolutional networks for skeleton-based action recognition. The features fusion mechanism in our network is cross-channel, i.e, the updated feature of the root node is derived from different feature channels. Because different feature channels come from different 1 × 1 convolution filters, the cross-channel fusion mechanism significantly improves the ability of the model to capture local features among adjacent vertices. Moreover, by introducing a channel attention mechanism to our model, we suppress the influence of feature channels unrelated to action recognition on model performance, which improves the robustness of the model against the feature channels independent of the current action. Extensive experiments on the two large-scale datasets, NTU-RGB+D and KineticsSkeleton, demonstrate that the performance of our model exceeds the current mainstream methods.https://ieeexplore.ieee.org/document/9316772/Skeleton-based action recognitiongraph convolutional networkchannel attentionaction recognition
spellingShingle	Jun Xie Wentian Xin Ruyi Liu Lijie Sheng Xiangzeng Liu Xuesong Gao Sheng Zhong Lei Tang Qiguang Miao Cross-Channel Graph Convolutional Networks for Skeleton-Based Action Recognition IEEE Access Skeleton-based action recognition graph convolutional network channel attention action recognition
title	Cross-Channel Graph Convolutional Networks for Skeleton-Based Action Recognition
title_full	Cross-Channel Graph Convolutional Networks for Skeleton-Based Action Recognition
title_fullStr	Cross-Channel Graph Convolutional Networks for Skeleton-Based Action Recognition
title_full_unstemmed	Cross-Channel Graph Convolutional Networks for Skeleton-Based Action Recognition
title_short	Cross-Channel Graph Convolutional Networks for Skeleton-Based Action Recognition
title_sort	cross channel graph convolutional networks for skeleton based action recognition
topic	Skeleton-based action recognition graph convolutional network channel attention action recognition
url	https://ieeexplore.ieee.org/document/9316772/
work_keys_str_mv	AT junxie crosschannelgraphconvolutionalnetworksforskeletonbasedactionrecognition AT wentianxin crosschannelgraphconvolutionalnetworksforskeletonbasedactionrecognition AT ruyiliu crosschannelgraphconvolutionalnetworksforskeletonbasedactionrecognition AT lijiesheng crosschannelgraphconvolutionalnetworksforskeletonbasedactionrecognition AT xiangzengliu crosschannelgraphconvolutionalnetworksforskeletonbasedactionrecognition AT xuesonggao crosschannelgraphconvolutionalnetworksforskeletonbasedactionrecognition AT shengzhong crosschannelgraphconvolutionalnetworksforskeletonbasedactionrecognition AT leitang crosschannelgraphconvolutionalnetworksforskeletonbasedactionrecognition AT qiguangmiao crosschannelgraphconvolutionalnetworksforskeletonbasedactionrecognition

Cross-Channel Graph Convolutional Networks for Skeleton-Based Action Recognition

Similar Items