Person Identification by Footstep Sound Using Convolutional Neural Networks

Human gait is very individual and it may serve as biometric to identify people in camera recordings. Comparable results can be achieved while using the acoustic signature of human footstep sounds. This acoustic solution offers the opportunity of less installation space and the use of cost-efficient...

Ausführliche Beschreibung

Bibliographische Detailangaben
Hauptverfasser:	Stephan Algermissen, Max Hörnlein
Format:	Artikel
Sprache:	English
Veröffentlicht:	MDPI AG 2021-05-01
Schriftenreihe:	Applied Mechanics
Schlagworte:	person identification convolutional neural networks MFCC gait recognition machine learning
Online Zugang:	https://www.mdpi.com/2673-3161/2/2/16

_version_	1827692873923952640
author	Stephan Algermissen Max Hörnlein
author_facet	Stephan Algermissen Max Hörnlein
author_sort	Stephan Algermissen
collection	DOAJ
description	Human gait is very individual and it may serve as biometric to identify people in camera recordings. Comparable results can be achieved while using the acoustic signature of human footstep sounds. This acoustic solution offers the opportunity of less installation space and the use of cost-efficient microphones when compared to visual system. In this paper, a method for person identification based on footstep sounds is proposed. First, step sounds are isolated from microphone recordings and separated into 500 ms samples. The samples are transformed with a sliding window into mel-frequency cepstral coefficients (MFCC). The result is represented as an image that serves as input to a convolutional neural network (CNN). The dataset for training and validating the CNN is recorded with five subjects in the acoustic lab of DLR. These experiments identify a total number of 1125 steps. The validation of the CNN reveals a minimum <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msub><mi>F</mi><mn>1</mn></msub></semantics></math></inline-formula>-score of 0.94 for all five classes and an accuracy of 0.98. The Grad-CAM method is applied to visualize the background of its decision in order to verify the functionality of the proposed CNN. Subsequently, two challenges for practical implementations, noise and different footwear, are discussed using experimental data.
first_indexed	2024-03-10T11:30:59Z
format	Article
id	doaj.art-ba4e4ee6656d4bb996bf830274f6a37d
institution	Directory Open Access Journal
issn	2673-3161
language	English
last_indexed	2024-03-10T11:30:59Z
publishDate	2021-05-01
publisher	MDPI AG
record_format	Article
series	Applied Mechanics
spelling	doaj.art-ba4e4ee6656d4bb996bf830274f6a37d2023-11-21T19:15:02ZengMDPI AGApplied Mechanics2673-31612021-05-012225727310.3390/applmech2020016Person Identification by Footstep Sound Using Convolutional Neural NetworksStephan Algermissen0Max Hörnlein1German Aerospace Center (DLR), Lilienthalplatz 7, 38108 Braunschweig, GermanyGerman Aerospace Center (DLR), Lilienthalplatz 7, 38108 Braunschweig, GermanyHuman gait is very individual and it may serve as biometric to identify people in camera recordings. Comparable results can be achieved while using the acoustic signature of human footstep sounds. This acoustic solution offers the opportunity of less installation space and the use of cost-efficient microphones when compared to visual system. In this paper, a method for person identification based on footstep sounds is proposed. First, step sounds are isolated from microphone recordings and separated into 500 ms samples. The samples are transformed with a sliding window into mel-frequency cepstral coefficients (MFCC). The result is represented as an image that serves as input to a convolutional neural network (CNN). The dataset for training and validating the CNN is recorded with five subjects in the acoustic lab of DLR. These experiments identify a total number of 1125 steps. The validation of the CNN reveals a minimum <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msub><mi>F</mi><mn>1</mn></msub></semantics></math></inline-formula>-score of 0.94 for all five classes and an accuracy of 0.98. The Grad-CAM method is applied to visualize the background of its decision in order to verify the functionality of the proposed CNN. Subsequently, two challenges for practical implementations, noise and different footwear, are discussed using experimental data.https://www.mdpi.com/2673-3161/2/2/16person identificationconvolutional neural networksMFCCgait recognitionmachine learning
spellingShingle	Stephan Algermissen Max Hörnlein Person Identification by Footstep Sound Using Convolutional Neural Networks Applied Mechanics person identification convolutional neural networks MFCC gait recognition machine learning
title	Person Identification by Footstep Sound Using Convolutional Neural Networks
title_full	Person Identification by Footstep Sound Using Convolutional Neural Networks
title_fullStr	Person Identification by Footstep Sound Using Convolutional Neural Networks
title_full_unstemmed	Person Identification by Footstep Sound Using Convolutional Neural Networks
title_short	Person Identification by Footstep Sound Using Convolutional Neural Networks
title_sort	person identification by footstep sound using convolutional neural networks
topic	person identification convolutional neural networks MFCC gait recognition machine learning
url	https://www.mdpi.com/2673-3161/2/2/16
work_keys_str_mv	AT stephanalgermissen personidentificationbyfootstepsoundusingconvolutionalneuralnetworks AT maxhornlein personidentificationbyfootstepsoundusingconvolutionalneuralnetworks

Person Identification by Footstep Sound Using Convolutional Neural Networks

Ähnliche Einträge