Comparison of a Deep Learning-Based Pose Estimation System to Marker-Based and Kinect Systems in Exergaming for Balance Training

Using standard digital cameras in combination with deep learning (DL) for pose estimation is promising for the in-home and independent use of exercise games (exergames). We need to investigate to what extent such DL-based systems can provide satisfying accuracy on exergame relevant measures. Our stu...

Full description

Bibliographic Details
Main Authors:	Elise Klæbo Vonstad, Xiaomeng Su, Beatrix Vereijken, Kerstin Bach, Jan Harald Nilsen
Format:	Article
Language:	English
Published:	MDPI AG 2020-12-01
Series:	Sensors
Subjects:	motion capture image analysis markerless motion capture exergaming segment lengths kinect
Online Access:	https://www.mdpi.com/1424-8220/20/23/6940

_version_	1797545782190014464
author	Elise Klæbo Vonstad Xiaomeng Su Beatrix Vereijken Kerstin Bach Jan Harald Nilsen
author_facet	Elise Klæbo Vonstad Xiaomeng Su Beatrix Vereijken Kerstin Bach Jan Harald Nilsen
author_sort	Elise Klæbo Vonstad
collection	DOAJ
description	Using standard digital cameras in combination with deep learning (DL) for pose estimation is promising for the in-home and independent use of exercise games (exergames). We need to investigate to what extent such DL-based systems can provide satisfying accuracy on exergame relevant measures. Our study assesses temporal variation (i.e., variability) in body segment lengths, while using a Deep Learning image processing tool (DeepLabCut, DLC) on two-dimensional (2D) video. This variability is then compared with a gold-standard, marker-based three-dimensional Motion Capturing system (3DMoCap, Qualisys AB), and a 3D RGB-depth camera system (Kinect V2, Microsoft Inc). Simultaneous data were collected from all three systems, while participants (N = 12) played a custom balance training exergame. The pose estimation DLC-model is pre-trained on a large-scale dataset (ImageNet) and optimized with context-specific pose annotated images. Wilcoxon’s signed-rank test was performed in order to assess the statistical significance of the differences in variability between systems. The results showed that the DLC method performs comparably to the Kinect and, in some segments, even to the 3DMoCap gold standard system with regard to variability. These results are promising for making exergames more accessible and easier to use, thereby increasing their availability for in-home exercise.
first_indexed	2024-03-10T14:19:53Z
format	Article
id	doaj.art-e6f3ea23cc2342af9a71c10be3d5e17b
institution	Directory Open Access Journal
issn	1424-8220
language	English
last_indexed	2024-03-10T14:19:53Z
publishDate	2020-12-01
publisher	MDPI AG
record_format	Article
series	Sensors
spelling	doaj.art-e6f3ea23cc2342af9a71c10be3d5e17b2023-11-20T23:30:40ZengMDPI AGSensors1424-82202020-12-012023694010.3390/s20236940Comparison of a Deep Learning-Based Pose Estimation System to Marker-Based and Kinect Systems in Exergaming for Balance TrainingElise Klæbo Vonstad0Xiaomeng Su1Beatrix Vereijken2Kerstin Bach3Jan Harald Nilsen4Department of Computer Science, Norwegian University of Science and Technology, 7034 Trondheim, NorwayDepartment of Computer Science, Norwegian University of Science and Technology, 7034 Trondheim, NorwayDepartment of Neuromedicine and Movement Science, Norwegian University of Science and Technology, 7030 Trondheim, NorwayDepartment of Computer Science, Norwegian University of Science and Technology, 7034 Trondheim, NorwayDepartment of Computer Science, Norwegian University of Science and Technology, 7034 Trondheim, NorwayUsing standard digital cameras in combination with deep learning (DL) for pose estimation is promising for the in-home and independent use of exercise games (exergames). We need to investigate to what extent such DL-based systems can provide satisfying accuracy on exergame relevant measures. Our study assesses temporal variation (i.e., variability) in body segment lengths, while using a Deep Learning image processing tool (DeepLabCut, DLC) on two-dimensional (2D) video. This variability is then compared with a gold-standard, marker-based three-dimensional Motion Capturing system (3DMoCap, Qualisys AB), and a 3D RGB-depth camera system (Kinect V2, Microsoft Inc). Simultaneous data were collected from all three systems, while participants (N = 12) played a custom balance training exergame. The pose estimation DLC-model is pre-trained on a large-scale dataset (ImageNet) and optimized with context-specific pose annotated images. Wilcoxon’s signed-rank test was performed in order to assess the statistical significance of the differences in variability between systems. The results showed that the DLC method performs comparably to the Kinect and, in some segments, even to the 3DMoCap gold standard system with regard to variability. These results are promising for making exergames more accessible and easier to use, thereby increasing their availability for in-home exercise.https://www.mdpi.com/1424-8220/20/23/6940motion captureimage analysismarkerless motion captureexergamingsegment lengthskinect
spellingShingle	Elise Klæbo Vonstad Xiaomeng Su Beatrix Vereijken Kerstin Bach Jan Harald Nilsen Comparison of a Deep Learning-Based Pose Estimation System to Marker-Based and Kinect Systems in Exergaming for Balance Training Sensors motion capture image analysis markerless motion capture exergaming segment lengths kinect
title	Comparison of a Deep Learning-Based Pose Estimation System to Marker-Based and Kinect Systems in Exergaming for Balance Training
title_full	Comparison of a Deep Learning-Based Pose Estimation System to Marker-Based and Kinect Systems in Exergaming for Balance Training
title_fullStr	Comparison of a Deep Learning-Based Pose Estimation System to Marker-Based and Kinect Systems in Exergaming for Balance Training
title_full_unstemmed	Comparison of a Deep Learning-Based Pose Estimation System to Marker-Based and Kinect Systems in Exergaming for Balance Training
title_short	Comparison of a Deep Learning-Based Pose Estimation System to Marker-Based and Kinect Systems in Exergaming for Balance Training
title_sort	comparison of a deep learning based pose estimation system to marker based and kinect systems in exergaming for balance training
topic	motion capture image analysis markerless motion capture exergaming segment lengths kinect
url	https://www.mdpi.com/1424-8220/20/23/6940
work_keys_str_mv	AT eliseklæbovonstad comparisonofadeeplearningbasedposeestimationsystemtomarkerbasedandkinectsystemsinexergamingforbalancetraining AT xiaomengsu comparisonofadeeplearningbasedposeestimationsystemtomarkerbasedandkinectsystemsinexergamingforbalancetraining AT beatrixvereijken comparisonofadeeplearningbasedposeestimationsystemtomarkerbasedandkinectsystemsinexergamingforbalancetraining AT kerstinbach comparisonofadeeplearningbasedposeestimationsystemtomarkerbasedandkinectsystemsinexergamingforbalancetraining AT janharaldnilsen comparisonofadeeplearningbasedposeestimationsystemtomarkerbasedandkinectsystemsinexergamingforbalancetraining

Comparison of a Deep Learning-Based Pose Estimation System to Marker-Based and Kinect Systems in Exergaming for Balance Training

Similar Items