Person Identification by Footstep Sound Using Convolutional Neural Networks

Human gait is very individual and it may serve as biometric to identify people in camera recordings. Comparable results can be achieved while using the acoustic signature of human footstep sounds. This acoustic solution offers the opportunity of less installation space and the use of cost-efficient...

Full description

Bibliographic Details
Main Authors: Stephan Algermissen, Max Hörnlein
Format: Article
Language:English
Published: MDPI AG 2021-05-01
Series:Applied Mechanics
Subjects:
Online Access:https://www.mdpi.com/2673-3161/2/2/16
_version_ 1797534543722315776
author Stephan Algermissen
Max Hörnlein
author_facet Stephan Algermissen
Max Hörnlein
author_sort Stephan Algermissen
collection DOAJ
description Human gait is very individual and it may serve as biometric to identify people in camera recordings. Comparable results can be achieved while using the acoustic signature of human footstep sounds. This acoustic solution offers the opportunity of less installation space and the use of cost-efficient microphones when compared to visual system. In this paper, a method for person identification based on footstep sounds is proposed. First, step sounds are isolated from microphone recordings and separated into 500 ms samples. The samples are transformed with a sliding window into mel-frequency cepstral coefficients (MFCC). The result is represented as an image that serves as input to a convolutional neural network (CNN). The dataset for training and validating the CNN is recorded with five subjects in the acoustic lab of DLR. These experiments identify a total number of 1125 steps. The validation of the CNN reveals a minimum <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msub><mi>F</mi><mn>1</mn></msub></semantics></math></inline-formula>-score of 0.94 for all five classes and an accuracy of 0.98. The Grad-CAM method is applied to visualize the background of its decision in order to verify the functionality of the proposed CNN. Subsequently, two challenges for practical implementations, noise and different footwear, are discussed using experimental data.
first_indexed 2024-03-10T11:30:59Z
format Article
id doaj.art-ba4e4ee6656d4bb996bf830274f6a37d
institution Directory Open Access Journal
issn 2673-3161
language English
last_indexed 2024-03-10T11:30:59Z
publishDate 2021-05-01
publisher MDPI AG
record_format Article
series Applied Mechanics
spelling doaj.art-ba4e4ee6656d4bb996bf830274f6a37d2023-11-21T19:15:02ZengMDPI AGApplied Mechanics2673-31612021-05-012225727310.3390/applmech2020016Person Identification by Footstep Sound Using Convolutional Neural NetworksStephan Algermissen0Max Hörnlein1German Aerospace Center (DLR), Lilienthalplatz 7, 38108 Braunschweig, GermanyGerman Aerospace Center (DLR), Lilienthalplatz 7, 38108 Braunschweig, GermanyHuman gait is very individual and it may serve as biometric to identify people in camera recordings. Comparable results can be achieved while using the acoustic signature of human footstep sounds. This acoustic solution offers the opportunity of less installation space and the use of cost-efficient microphones when compared to visual system. In this paper, a method for person identification based on footstep sounds is proposed. First, step sounds are isolated from microphone recordings and separated into 500 ms samples. The samples are transformed with a sliding window into mel-frequency cepstral coefficients (MFCC). The result is represented as an image that serves as input to a convolutional neural network (CNN). The dataset for training and validating the CNN is recorded with five subjects in the acoustic lab of DLR. These experiments identify a total number of 1125 steps. The validation of the CNN reveals a minimum <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msub><mi>F</mi><mn>1</mn></msub></semantics></math></inline-formula>-score of 0.94 for all five classes and an accuracy of 0.98. The Grad-CAM method is applied to visualize the background of its decision in order to verify the functionality of the proposed CNN. Subsequently, two challenges for practical implementations, noise and different footwear, are discussed using experimental data.https://www.mdpi.com/2673-3161/2/2/16person identificationconvolutional neural networksMFCCgait recognitionmachine learning
spellingShingle Stephan Algermissen
Max Hörnlein
Person Identification by Footstep Sound Using Convolutional Neural Networks
Applied Mechanics
person identification
convolutional neural networks
MFCC
gait recognition
machine learning
title Person Identification by Footstep Sound Using Convolutional Neural Networks
title_full Person Identification by Footstep Sound Using Convolutional Neural Networks
title_fullStr Person Identification by Footstep Sound Using Convolutional Neural Networks
title_full_unstemmed Person Identification by Footstep Sound Using Convolutional Neural Networks
title_short Person Identification by Footstep Sound Using Convolutional Neural Networks
title_sort person identification by footstep sound using convolutional neural networks
topic person identification
convolutional neural networks
MFCC
gait recognition
machine learning
url https://www.mdpi.com/2673-3161/2/2/16
work_keys_str_mv AT stephanalgermissen personidentificationbyfootstepsoundusingconvolutionalneuralnetworks
AT maxhornlein personidentificationbyfootstepsoundusingconvolutionalneuralnetworks