Light-Weight Classification of Human Actions in Video with Skeleton-Based Features

An approach to human action classification in videos is presented, based on knowledge-aware initial features extracted from human skeleton data and on further processing by convolutional networks. The proposed smart tracking of skeleton joints, approximation of missing joints and normalization of sk...

Full description

Bibliographic Details
Main Authors:	Włodzimierz Kasprzak, Bartłomiej Jankowski
Format:	Article
Language:	English
Published:	MDPI AG 2022-07-01
Series:	Electronics
Subjects:	human actions neural network models skeleton data machine learning NTU-RGBD dataset OpenPose
Online Access:	https://www.mdpi.com/2079-9292/11/14/2145

_version_	1797433701800345600
author	Włodzimierz Kasprzak Bartłomiej Jankowski
author_facet	Włodzimierz Kasprzak Bartłomiej Jankowski
author_sort	Włodzimierz Kasprzak
collection	DOAJ
description	An approach to human action classification in videos is presented, based on knowledge-aware initial features extracted from human skeleton data and on further processing by convolutional networks. The proposed smart tracking of skeleton joints, approximation of missing joints and normalization of skeleton data are important steps of feature extraction. Three neural network models—based on LSTM, Transformer and CNN—are developed and experimentally verified. The models are trained and tested on the well-known NTU-RGB+D (Shahroudy et al., 2016) dataset in the cross-view mode. The obtained results show a competitive performance with other SOTA methods and verify the efficiency of proposed feature engineering. The network has a five times lower number of trainable parameters than other proposed methods to reach nearly similar performance and twenty times lower number than the currently best performing solutions. Thanks to the lightness of the classifier, the solution only requires relatively small computational resources.
first_indexed	2024-03-09T10:20:45Z
format	Article
id	doaj.art-df48ae6483424f92a6886df0bd4814eb
institution	Directory Open Access Journal
issn	2079-9292
language	English
last_indexed	2024-03-09T10:20:45Z
publishDate	2022-07-01
publisher	MDPI AG
record_format	Article
series	Electronics
spelling	doaj.art-df48ae6483424f92a6886df0bd4814eb2023-12-01T22:05:11ZengMDPI AGElectronics2079-92922022-07-011114214510.3390/electronics11142145Light-Weight Classification of Human Actions in Video with Skeleton-Based FeaturesWłodzimierz Kasprzak0Bartłomiej Jankowski1Institute of Control and Computation Engineering, Warsaw University of Technology, 00-661 Warszawa, PolandInstitute of Control and Computation Engineering, Warsaw University of Technology, 00-661 Warszawa, PolandAn approach to human action classification in videos is presented, based on knowledge-aware initial features extracted from human skeleton data and on further processing by convolutional networks. The proposed smart tracking of skeleton joints, approximation of missing joints and normalization of skeleton data are important steps of feature extraction. Three neural network models—based on LSTM, Transformer and CNN—are developed and experimentally verified. The models are trained and tested on the well-known NTU-RGB+D (Shahroudy et al., 2016) dataset in the cross-view mode. The obtained results show a competitive performance with other SOTA methods and verify the efficiency of proposed feature engineering. The network has a five times lower number of trainable parameters than other proposed methods to reach nearly similar performance and twenty times lower number than the currently best performing solutions. Thanks to the lightness of the classifier, the solution only requires relatively small computational resources.https://www.mdpi.com/2079-9292/11/14/2145human actionsneural network modelsskeleton datamachine learningNTU-RGBD datasetOpenPose
spellingShingle	Włodzimierz Kasprzak Bartłomiej Jankowski Light-Weight Classification of Human Actions in Video with Skeleton-Based Features Electronics human actions neural network models skeleton data machine learning NTU-RGBD dataset OpenPose
title	Light-Weight Classification of Human Actions in Video with Skeleton-Based Features
title_full	Light-Weight Classification of Human Actions in Video with Skeleton-Based Features
title_fullStr	Light-Weight Classification of Human Actions in Video with Skeleton-Based Features
title_full_unstemmed	Light-Weight Classification of Human Actions in Video with Skeleton-Based Features
title_short	Light-Weight Classification of Human Actions in Video with Skeleton-Based Features
title_sort	light weight classification of human actions in video with skeleton based features
topic	human actions neural network models skeleton data machine learning NTU-RGBD dataset OpenPose
url	https://www.mdpi.com/2079-9292/11/14/2145
work_keys_str_mv	AT włodzimierzkasprzak lightweightclassificationofhumanactionsinvideowithskeletonbasedfeatures AT bartłomiejjankowski lightweightclassificationofhumanactionsinvideowithskeletonbasedfeatures

Light-Weight Classification of Human Actions in Video with Skeleton-Based Features

Similar Items