MuPeG—The Multiple Person Gait Framework

Gait recognition is being employed as an effective approach to identify people without requiring subject collaboration. Nowadays, developed techniques for this task are obtaining high performance on current datasets (usually more than <inline-formula> <math display="inline"> &l...

Full description

Bibliographic Details
Main Authors: Rubén Delgado-Escaño, Francisco M. Castro, Julián R. Cózar, Manuel J. Marín-Jiménez, Nicolás Guil
Format: Article
Language:English
Published: MDPI AG 2020-03-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/20/5/1358
Description
Summary:Gait recognition is being employed as an effective approach to identify people without requiring subject collaboration. Nowadays, developed techniques for this task are obtaining high performance on current datasets (usually more than <inline-formula> <math display="inline"> <semantics> <mrow> <mn>90</mn> <mo>%</mo> </mrow> </semantics> </math> </inline-formula> of accuracy). However, those datasets are simple as they only contain one subject in the scene at the same time. This fact limits the extrapolation of the results to real world conditions where, usually, multiple subjects are simultaneously present at the scene, generating different types of occlusions and requiring better tracking methods and models trained to deal with those situations. Thus, with the aim of evaluating more realistic and challenging situations appearing in scenarios with multiple subjects, we release a new framework (MuPeG) that generates augmented datasets with multiple subjects using existing datasets as input. By this way, it is not necessary to record and label new videos, since it is automatically done by our framework. In addition, based on the use of datasets generated by our framework, we propose an experimental methodology that describes how to use datasets with multiple subjects and the recommended experiments that are necessary to perform. Moreover, we release the first experimental results using datasets with multiple subjects. In our case, we use an augmented version of TUM-GAID and CASIA-B datasets obtained with our framework. In these augmented datasets the obtained accuracies are <inline-formula> <math display="inline"> <semantics> <mrow> <mn>54.8</mn> <mo>%</mo> </mrow> </semantics> </math> </inline-formula> and <inline-formula> <math display="inline"> <semantics> <mrow> <mn>42.3</mn> <mo>%</mo> </mrow> </semantics> </math> </inline-formula> whereas in the original datasets (single subject), the same model achieved <inline-formula> <math display="inline"> <semantics> <mrow> <mn>99.7</mn> <mo>%</mo> </mrow> </semantics> </math> </inline-formula> and <inline-formula> <math display="inline"> <semantics> <mrow> <mn>98.0</mn> <mo>%</mo> </mrow> </semantics> </math> </inline-formula> for TUM-GAID and CASIA-B, respectively. The performance drop shows clearly that the difficulty of datasets with multiple subjects in the scene is much higher than the ones reported in the literature for a single subject. Thus, our proposed framework is able to generate useful datasets with multiple subjects which are more similar to real life situations.
ISSN:1424-8220