Sign Language Gesture Recognition and Classification Based on Event Camera with Spiking Neural Networks

Sign language recognition has been utilized in human–machine interactions, improving the lives of people with speech impairments or who rely on nonverbal instructions. Thanks to its higher temporal resolution, less visual redundancy information and lower energy consumption, the use of an event camer...

Full description

Bibliographic Details
Main Authors: Xuena Chen, Li Su, Jinxiu Zhao, Keni Qiu, Na Jiang, Guang Zhai
Format: Article
Language:English
Published: MDPI AG 2023-02-01
Series:Electronics
Subjects:
Online Access:https://www.mdpi.com/2079-9292/12/4/786
_version_ 1797621339506343936
author Xuena Chen
Li Su
Jinxiu Zhao
Keni Qiu
Na Jiang
Guang Zhai
author_facet Xuena Chen
Li Su
Jinxiu Zhao
Keni Qiu
Na Jiang
Guang Zhai
author_sort Xuena Chen
collection DOAJ
description Sign language recognition has been utilized in human–machine interactions, improving the lives of people with speech impairments or who rely on nonverbal instructions. Thanks to its higher temporal resolution, less visual redundancy information and lower energy consumption, the use of an event camera with a new dynamic vision sensor (DVS) shows promise with regard to sign language recognition with robot perception and intelligent control. Although previous work has focused on event camera-based, simple gesture datasets, such as DVS128Gesture, event camera gesture datasets inspired by sign language are critical, which poses a great impediment to the development of event camera-based sign language recognition. An effective method to extract spatio-temporal features from event data is significantly desired. Firstly, the event-based sign language gesture datasets are proposed and the data have two sources: traditional sign language videos to event stream (DVS_Sign_v2e) and DAVIS346 (DVS_Sign). In the present dataset, data are divided into five classification, verbs, quantifiers, position, things and people, adapting to actual scenarios where robots provide instruction or assistance. Sign language classification is demonstrated in spike neuron networks with a spatio-temporal back-propagation training method, leading to the best recognition accuracy of 77%. This work paves the way for the combination of event camera-based sign language gesture recognition and robotic perception for the future intelligent systems.
first_indexed 2024-03-11T08:54:34Z
format Article
id doaj.art-075c36a02f07445aa3ce753346711ef6
institution Directory Open Access Journal
issn 2079-9292
language English
last_indexed 2024-03-11T08:54:34Z
publishDate 2023-02-01
publisher MDPI AG
record_format Article
series Electronics
spelling doaj.art-075c36a02f07445aa3ce753346711ef62023-11-16T20:10:01ZengMDPI AGElectronics2079-92922023-02-0112478610.3390/electronics12040786Sign Language Gesture Recognition and Classification Based on Event Camera with Spiking Neural NetworksXuena Chen0Li Su1Jinxiu Zhao2Keni Qiu3Na Jiang4Guang Zhai5School of Information Engineering, Capital Normal University, Beijing 100048, ChinaSchool of Information Engineering, Capital Normal University, Beijing 100048, ChinaSchool of Information Engineering, Capital Normal University, Beijing 100048, ChinaSchool of Information Engineering, Capital Normal University, Beijing 100048, ChinaSchool of Information Engineering, Capital Normal University, Beijing 100048, ChinaSchool of Aerospace Engineering, Beijing Institute of Technology, Beijing 100081, ChinaSign language recognition has been utilized in human–machine interactions, improving the lives of people with speech impairments or who rely on nonverbal instructions. Thanks to its higher temporal resolution, less visual redundancy information and lower energy consumption, the use of an event camera with a new dynamic vision sensor (DVS) shows promise with regard to sign language recognition with robot perception and intelligent control. Although previous work has focused on event camera-based, simple gesture datasets, such as DVS128Gesture, event camera gesture datasets inspired by sign language are critical, which poses a great impediment to the development of event camera-based sign language recognition. An effective method to extract spatio-temporal features from event data is significantly desired. Firstly, the event-based sign language gesture datasets are proposed and the data have two sources: traditional sign language videos to event stream (DVS_Sign_v2e) and DAVIS346 (DVS_Sign). In the present dataset, data are divided into five classification, verbs, quantifiers, position, things and people, adapting to actual scenarios where robots provide instruction or assistance. Sign language classification is demonstrated in spike neuron networks with a spatio-temporal back-propagation training method, leading to the best recognition accuracy of 77%. This work paves the way for the combination of event camera-based sign language gesture recognition and robotic perception for the future intelligent systems.https://www.mdpi.com/2079-9292/12/4/786event cameraspiking neural networkDVS-sign languagesign language recognitionintelligent system
spellingShingle Xuena Chen
Li Su
Jinxiu Zhao
Keni Qiu
Na Jiang
Guang Zhai
Sign Language Gesture Recognition and Classification Based on Event Camera with Spiking Neural Networks
Electronics
event camera
spiking neural network
DVS-sign language
sign language recognition
intelligent system
title Sign Language Gesture Recognition and Classification Based on Event Camera with Spiking Neural Networks
title_full Sign Language Gesture Recognition and Classification Based on Event Camera with Spiking Neural Networks
title_fullStr Sign Language Gesture Recognition and Classification Based on Event Camera with Spiking Neural Networks
title_full_unstemmed Sign Language Gesture Recognition and Classification Based on Event Camera with Spiking Neural Networks
title_short Sign Language Gesture Recognition and Classification Based on Event Camera with Spiking Neural Networks
title_sort sign language gesture recognition and classification based on event camera with spiking neural networks
topic event camera
spiking neural network
DVS-sign language
sign language recognition
intelligent system
url https://www.mdpi.com/2079-9292/12/4/786
work_keys_str_mv AT xuenachen signlanguagegesturerecognitionandclassificationbasedoneventcamerawithspikingneuralnetworks
AT lisu signlanguagegesturerecognitionandclassificationbasedoneventcamerawithspikingneuralnetworks
AT jinxiuzhao signlanguagegesturerecognitionandclassificationbasedoneventcamerawithspikingneuralnetworks
AT keniqiu signlanguagegesturerecognitionandclassificationbasedoneventcamerawithspikingneuralnetworks
AT najiang signlanguagegesturerecognitionandclassificationbasedoneventcamerawithspikingneuralnetworks
AT guangzhai signlanguagegesturerecognitionandclassificationbasedoneventcamerawithspikingneuralnetworks