Human Movement Representation on Multivariate Time Series for Recognition of Professional Gestures and Forecasting Their Trajectories

Human-centered artificial intelligence is increasingly deployed in professional workplaces in Industry 4.0 to address various challenges related to the collaboration between the operators and the machines, the augmentation of their capabilities, or the improvement of the quality of their work and li...

Full description

Bibliographic Details
Main Authors: Sotiris Manitsaris, Gavriela Senteri, Dimitrios Makrygiannis, Alina Glushkova
Format: Article
Language:English
Published: Frontiers Media S.A. 2020-08-01
Series:Frontiers in Robotics and AI
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/frobt.2020.00080/full
_version_ 1798022746641268736
author Sotiris Manitsaris
Gavriela Senteri
Dimitrios Makrygiannis
Alina Glushkova
author_facet Sotiris Manitsaris
Gavriela Senteri
Dimitrios Makrygiannis
Alina Glushkova
author_sort Sotiris Manitsaris
collection DOAJ
description Human-centered artificial intelligence is increasingly deployed in professional workplaces in Industry 4.0 to address various challenges related to the collaboration between the operators and the machines, the augmentation of their capabilities, or the improvement of the quality of their work and life in general. Intelligent systems and autonomous machines need to continuously recognize and follow the professional actions and gestures of the operators in order to collaborate with them and anticipate their trajectories for avoiding potential collisions and accidents. Nevertheless, the recognition of patterns of professional gestures is a very challenging task for both research and the industry. There are various types of human movements that the intelligent systems need to perceive, for example, gestural commands to machines and professional actions with or without the use of tools. Moreover, the interclass and intraclass spatiotemporal variances together with the very limited access to annotated human motion data constitute a major research challenge. In this paper, we introduce the gesture operational model, which describes how gestures are performed based on assumptions that focus on the dynamic association of body entities, their synergies, and their serial and non-serial mediations, as well as their transitioning over time from one state to another. Then, the assumptions of the gesture operational model are translated into a simultaneous equation system for each body entity through state-space modeling. The coefficients of the equation are computed using the maximum likelihood estimation method. The simulation of the model generates a confidence-bounding box for every entity that describes the tolerance of its spatial variance over time. The contribution of our approach is demonstrated for both recognizing gestures and forecasting human motion trajectories. In recognition, it is combined with continuous hidden Markov models to boost the recognition accuracy when the likelihoods are not confident. In forecasting, a motion trajectory can be estimated by taking as minimum input two observations only. The performance of the algorithm has been evaluated using four industrial datasets that contain gestures and actions from a TV assembly line: the glassblowing industry, the gestural commands to automated guided vehicles as well as the human–robot collaboration in the automotive assembly lines. The hybrid approach State-Space and HMMs outperforms standard continuous HMMs and a 3DCNN-based end-to-end deep architecture.
first_indexed 2024-04-11T17:35:02Z
format Article
id doaj.art-dd3b5705c6dd4a9388ab0634cb8a18f5
institution Directory Open Access Journal
issn 2296-9144
language English
last_indexed 2024-04-11T17:35:02Z
publishDate 2020-08-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Robotics and AI
spelling doaj.art-dd3b5705c6dd4a9388ab0634cb8a18f52022-12-22T04:11:44ZengFrontiers Media S.A.Frontiers in Robotics and AI2296-91442020-08-01710.3389/frobt.2020.00080518485Human Movement Representation on Multivariate Time Series for Recognition of Professional Gestures and Forecasting Their TrajectoriesSotiris ManitsarisGavriela SenteriDimitrios MakrygiannisAlina GlushkovaHuman-centered artificial intelligence is increasingly deployed in professional workplaces in Industry 4.0 to address various challenges related to the collaboration between the operators and the machines, the augmentation of their capabilities, or the improvement of the quality of their work and life in general. Intelligent systems and autonomous machines need to continuously recognize and follow the professional actions and gestures of the operators in order to collaborate with them and anticipate their trajectories for avoiding potential collisions and accidents. Nevertheless, the recognition of patterns of professional gestures is a very challenging task for both research and the industry. There are various types of human movements that the intelligent systems need to perceive, for example, gestural commands to machines and professional actions with or without the use of tools. Moreover, the interclass and intraclass spatiotemporal variances together with the very limited access to annotated human motion data constitute a major research challenge. In this paper, we introduce the gesture operational model, which describes how gestures are performed based on assumptions that focus on the dynamic association of body entities, their synergies, and their serial and non-serial mediations, as well as their transitioning over time from one state to another. Then, the assumptions of the gesture operational model are translated into a simultaneous equation system for each body entity through state-space modeling. The coefficients of the equation are computed using the maximum likelihood estimation method. The simulation of the model generates a confidence-bounding box for every entity that describes the tolerance of its spatial variance over time. The contribution of our approach is demonstrated for both recognizing gestures and forecasting human motion trajectories. In recognition, it is combined with continuous hidden Markov models to boost the recognition accuracy when the likelihoods are not confident. In forecasting, a motion trajectory can be estimated by taking as minimum input two observations only. The performance of the algorithm has been evaluated using four industrial datasets that contain gestures and actions from a TV assembly line: the glassblowing industry, the gestural commands to automated guided vehicles as well as the human–robot collaboration in the automotive assembly lines. The hybrid approach State-Space and HMMs outperforms standard continuous HMMs and a 3DCNN-based end-to-end deep architecture.https://www.frontiersin.org/article/10.3389/frobt.2020.00080/fullstate-space representationdifferential equationsmovement modelinghidden Markov modelsgesture recognitionforecasting
spellingShingle Sotiris Manitsaris
Gavriela Senteri
Dimitrios Makrygiannis
Alina Glushkova
Human Movement Representation on Multivariate Time Series for Recognition of Professional Gestures and Forecasting Their Trajectories
Frontiers in Robotics and AI
state-space representation
differential equations
movement modeling
hidden Markov models
gesture recognition
forecasting
title Human Movement Representation on Multivariate Time Series for Recognition of Professional Gestures and Forecasting Their Trajectories
title_full Human Movement Representation on Multivariate Time Series for Recognition of Professional Gestures and Forecasting Their Trajectories
title_fullStr Human Movement Representation on Multivariate Time Series for Recognition of Professional Gestures and Forecasting Their Trajectories
title_full_unstemmed Human Movement Representation on Multivariate Time Series for Recognition of Professional Gestures and Forecasting Their Trajectories
title_short Human Movement Representation on Multivariate Time Series for Recognition of Professional Gestures and Forecasting Their Trajectories
title_sort human movement representation on multivariate time series for recognition of professional gestures and forecasting their trajectories
topic state-space representation
differential equations
movement modeling
hidden Markov models
gesture recognition
forecasting
url https://www.frontiersin.org/article/10.3389/frobt.2020.00080/full
work_keys_str_mv AT sotirismanitsaris humanmovementrepresentationonmultivariatetimeseriesforrecognitionofprofessionalgesturesandforecastingtheirtrajectories
AT gavrielasenteri humanmovementrepresentationonmultivariatetimeseriesforrecognitionofprofessionalgesturesandforecastingtheirtrajectories
AT dimitriosmakrygiannis humanmovementrepresentationonmultivariatetimeseriesforrecognitionofprofessionalgesturesandforecastingtheirtrajectories
AT alinaglushkova humanmovementrepresentationonmultivariatetimeseriesforrecognitionofprofessionalgesturesandforecastingtheirtrajectories