Infusing a Convolutional Neural Network with Encoded Joint Node Image Data to Recognize 25 Daily Human Activities

Human activity recognition (HAR) has gained popularity in the field of computer vision such as video surveillance, security, and virtual reality. However, traditional methods are limited in terms of computations and holistic learning of human skeletal sequences. In this article, a new time‐series sk...

Full description

Bibliographic Details
Main Authors: Yuliang Zhao, Tianang Sun, Zhongjie Ju, Fanghecong Dong, Le Yang, Xiaoyong Lv, Chao Lian, Meng Chen, Wen Jung Li
Format: Article
Language:English
Published: Wiley 2023-11-01
Series:Advanced Intelligent Systems
Subjects:
Online Access:https://doi.org/10.1002/aisy.202300266
_version_ 1827632237210763264
author Yuliang Zhao
Tianang Sun
Zhongjie Ju
Fanghecong Dong
Le Yang
Xiaoyong Lv
Chao Lian
Meng Chen
Wen Jung Li
author_facet Yuliang Zhao
Tianang Sun
Zhongjie Ju
Fanghecong Dong
Le Yang
Xiaoyong Lv
Chao Lian
Meng Chen
Wen Jung Li
author_sort Yuliang Zhao
collection DOAJ
description Human activity recognition (HAR) has gained popularity in the field of computer vision such as video surveillance, security, and virtual reality. However, traditional methods are limited in terms of computations and holistic learning of human skeletal sequences. In this article, a new time‐series skeleton joint data imaging method is infused into an improved convolutional neural network to handle these problems. First, the raw time‐series data of 33 body nodes are transformed to red–green–blue images by encoding the 3D positional information to one pixel. Second, the LeNet‐5 network is enhanced by expanding the receptive field, introducing coordinate attention and the smooth maximum unit to improve smoothness and feature extraction. Third, the ability of coded images to express human activities is studied in various environments. It is shown in the experimental results that the method achieves an impressive accuracy of 98.02% in recognizing 25 daily human activities, such as running, writing, and walking. In addition, it is shown that the number of floating point operations, parameters, and inference time of the method are 0.08%, 0.47%, and 3.05%, respectively, of the average values for six other networks (including AlexNet, GoogLeNet, and MobileNet). The proposed method is thus a novel, lightweight, and high‐precision solution for HAR.
first_indexed 2024-03-09T14:33:15Z
format Article
id doaj.art-f8e0188cb0c144a29bc68065a152df7a
institution Directory Open Access Journal
issn 2640-4567
language English
last_indexed 2024-03-09T14:33:15Z
publishDate 2023-11-01
publisher Wiley
record_format Article
series Advanced Intelligent Systems
spelling doaj.art-f8e0188cb0c144a29bc68065a152df7a2023-11-27T21:14:08ZengWileyAdvanced Intelligent Systems2640-45672023-11-01511n/an/a10.1002/aisy.202300266Infusing a Convolutional Neural Network with Encoded Joint Node Image Data to Recognize 25 Daily Human ActivitiesYuliang Zhao0Tianang Sun1Zhongjie Ju2Fanghecong Dong3Le Yang4Xiaoyong Lv5Chao Lian6Meng Chen7Wen Jung Li8School of Information Science and Engineering Northeastern University Shenyang 110819 ChinaSchool of Information Science and Engineering Northeastern University Shenyang 110819 ChinaSchool of Information Science and Engineering Northeastern University Shenyang 110819 ChinaSchool of Information Science and Engineering Northeastern University Shenyang 110819 ChinaSchool of Information Science and Engineering Northeastern University Shenyang 110819 ChinaSchool of Information Science and Engineering Northeastern University Shenyang 110819 ChinaSchool of Information Science and Engineering Northeastern University Shenyang 110819 ChinaDepartment of Mechanical Engineering City University of Hong Kong Hong Kong SAR 999077 ChinaDepartment of Mechanical Engineering City University of Hong Kong Hong Kong SAR 999077 ChinaHuman activity recognition (HAR) has gained popularity in the field of computer vision such as video surveillance, security, and virtual reality. However, traditional methods are limited in terms of computations and holistic learning of human skeletal sequences. In this article, a new time‐series skeleton joint data imaging method is infused into an improved convolutional neural network to handle these problems. First, the raw time‐series data of 33 body nodes are transformed to red–green–blue images by encoding the 3D positional information to one pixel. Second, the LeNet‐5 network is enhanced by expanding the receptive field, introducing coordinate attention and the smooth maximum unit to improve smoothness and feature extraction. Third, the ability of coded images to express human activities is studied in various environments. It is shown in the experimental results that the method achieves an impressive accuracy of 98.02% in recognizing 25 daily human activities, such as running, writing, and walking. In addition, it is shown that the number of floating point operations, parameters, and inference time of the method are 0.08%, 0.47%, and 3.05%, respectively, of the average values for six other networks (including AlexNet, GoogLeNet, and MobileNet). The proposed method is thus a novel, lightweight, and high‐precision solution for HAR.https://doi.org/10.1002/aisy.202300266convolutional neural networkhuman activity recognitionimage processing
spellingShingle Yuliang Zhao
Tianang Sun
Zhongjie Ju
Fanghecong Dong
Le Yang
Xiaoyong Lv
Chao Lian
Meng Chen
Wen Jung Li
Infusing a Convolutional Neural Network with Encoded Joint Node Image Data to Recognize 25 Daily Human Activities
Advanced Intelligent Systems
convolutional neural network
human activity recognition
image processing
title Infusing a Convolutional Neural Network with Encoded Joint Node Image Data to Recognize 25 Daily Human Activities
title_full Infusing a Convolutional Neural Network with Encoded Joint Node Image Data to Recognize 25 Daily Human Activities
title_fullStr Infusing a Convolutional Neural Network with Encoded Joint Node Image Data to Recognize 25 Daily Human Activities
title_full_unstemmed Infusing a Convolutional Neural Network with Encoded Joint Node Image Data to Recognize 25 Daily Human Activities
title_short Infusing a Convolutional Neural Network with Encoded Joint Node Image Data to Recognize 25 Daily Human Activities
title_sort infusing a convolutional neural network with encoded joint node image data to recognize 25 daily human activities
topic convolutional neural network
human activity recognition
image processing
url https://doi.org/10.1002/aisy.202300266
work_keys_str_mv AT yuliangzhao infusingaconvolutionalneuralnetworkwithencodedjointnodeimagedatatorecognize25dailyhumanactivities
AT tianangsun infusingaconvolutionalneuralnetworkwithencodedjointnodeimagedatatorecognize25dailyhumanactivities
AT zhongjieju infusingaconvolutionalneuralnetworkwithencodedjointnodeimagedatatorecognize25dailyhumanactivities
AT fanghecongdong infusingaconvolutionalneuralnetworkwithencodedjointnodeimagedatatorecognize25dailyhumanactivities
AT leyang infusingaconvolutionalneuralnetworkwithencodedjointnodeimagedatatorecognize25dailyhumanactivities
AT xiaoyonglv infusingaconvolutionalneuralnetworkwithencodedjointnodeimagedatatorecognize25dailyhumanactivities
AT chaolian infusingaconvolutionalneuralnetworkwithencodedjointnodeimagedatatorecognize25dailyhumanactivities
AT mengchen infusingaconvolutionalneuralnetworkwithencodedjointnodeimagedatatorecognize25dailyhumanactivities
AT wenjungli infusingaconvolutionalneuralnetworkwithencodedjointnodeimagedatatorecognize25dailyhumanactivities