Electroglottography based real-time voice-to-MIDI controller

Voice-to-MIDI real-time conversion is a challenging problem that comes with a series of obstacles and complications. The main issue is the tracking of the human voice pitch. Extracting the voice fundamental frequency can be inaccurate and highly computationally exacting due to the spectral complexit...

Full description

Bibliographic Details
Main Authors: Eugenio Donati, Christos Chousidis
Format: Article
Language:English
Published: Elsevier 2022-06-01
Series:Neuroscience Informatics
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2772528622000036
_version_ 1817980605281861632
author Eugenio Donati
Christos Chousidis
author_facet Eugenio Donati
Christos Chousidis
author_sort Eugenio Donati
collection DOAJ
description Voice-to-MIDI real-time conversion is a challenging problem that comes with a series of obstacles and complications. The main issue is the tracking of the human voice pitch. Extracting the voice fundamental frequency can be inaccurate and highly computationally exacting due to the spectral complexity of voice signals. In addition, on account of microphone usage, the presence of environmental noise can further affect voice processing. An analysis of the current research and status of the market shows a plethora of voice-to-MIDI implementations revolving around the processing of audio signals deriving from microphones. This paper addresses the above-mentioned issues by implementing a novel experimental method where electroglottography is employed instead of microphones as a source for pitch-tracking. In the proposed system, the signal is processed and converted through an embedded hardware device. The use of electroglottography improves both the accuracy of pitch evaluation and the ease of voice information processing; firstly, it provides a direct measurement of the vocal folds' activity and, secondly, it bypasses the interferences caused by external sound sources. This allows the extraction of a simpler and cleaner signal that yields a more effective evaluation of the fundamental frequency during phonation. The proposed method delivers a faster and less computationally demanding conversion thus in turn, allowing for an efficacious real-time voice-to-MIDI conversion.
first_indexed 2024-04-13T22:55:34Z
format Article
id doaj.art-200129e3d043480b811a8e8598048345
institution Directory Open Access Journal
issn 2772-5286
language English
last_indexed 2024-04-13T22:55:34Z
publishDate 2022-06-01
publisher Elsevier
record_format Article
series Neuroscience Informatics
spelling doaj.art-200129e3d043480b811a8e85980483452022-12-22T02:26:01ZengElsevierNeuroscience Informatics2772-52862022-06-0122100041Electroglottography based real-time voice-to-MIDI controllerEugenio Donati0Christos Chousidis1Corresponding author.; School of Computing and Engineering, University of West London, St. Mary's road, W55RF, London, UKSchool of Computing and Engineering, University of West London, St. Mary's road, W55RF, London, UKVoice-to-MIDI real-time conversion is a challenging problem that comes with a series of obstacles and complications. The main issue is the tracking of the human voice pitch. Extracting the voice fundamental frequency can be inaccurate and highly computationally exacting due to the spectral complexity of voice signals. In addition, on account of microphone usage, the presence of environmental noise can further affect voice processing. An analysis of the current research and status of the market shows a plethora of voice-to-MIDI implementations revolving around the processing of audio signals deriving from microphones. This paper addresses the above-mentioned issues by implementing a novel experimental method where electroglottography is employed instead of microphones as a source for pitch-tracking. In the proposed system, the signal is processed and converted through an embedded hardware device. The use of electroglottography improves both the accuracy of pitch evaluation and the ease of voice information processing; firstly, it provides a direct measurement of the vocal folds' activity and, secondly, it bypasses the interferences caused by external sound sources. This allows the extraction of a simpler and cleaner signal that yields a more effective evaluation of the fundamental frequency during phonation. The proposed method delivers a faster and less computationally demanding conversion thus in turn, allowing for an efficacious real-time voice-to-MIDI conversion.http://www.sciencedirect.com/science/article/pii/S2772528622000036ElectroglottographyBioimpedance measurementsEGG-to-MIDIVoice-to-MIDIVoice information retrievalReal-time audio conversion
spellingShingle Eugenio Donati
Christos Chousidis
Electroglottography based real-time voice-to-MIDI controller
Neuroscience Informatics
Electroglottography
Bioimpedance measurements
EGG-to-MIDI
Voice-to-MIDI
Voice information retrieval
Real-time audio conversion
title Electroglottography based real-time voice-to-MIDI controller
title_full Electroglottography based real-time voice-to-MIDI controller
title_fullStr Electroglottography based real-time voice-to-MIDI controller
title_full_unstemmed Electroglottography based real-time voice-to-MIDI controller
title_short Electroglottography based real-time voice-to-MIDI controller
title_sort electroglottography based real time voice to midi controller
topic Electroglottography
Bioimpedance measurements
EGG-to-MIDI
Voice-to-MIDI
Voice information retrieval
Real-time audio conversion
url http://www.sciencedirect.com/science/article/pii/S2772528622000036
work_keys_str_mv AT eugeniodonati electroglottographybasedrealtimevoicetomidicontroller
AT christoschousidis electroglottographybasedrealtimevoicetomidicontroller