Electroglottography based real-time voice-to-MIDI controller
Voice-to-MIDI real-time conversion is a challenging problem that comes with a series of obstacles and complications. The main issue is the tracking of the human voice pitch. Extracting the voice fundamental frequency can be inaccurate and highly computationally exacting due to the spectral complexit...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2022-06-01
|
Series: | Neuroscience Informatics |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2772528622000036 |
_version_ | 1817980605281861632 |
---|---|
author | Eugenio Donati Christos Chousidis |
author_facet | Eugenio Donati Christos Chousidis |
author_sort | Eugenio Donati |
collection | DOAJ |
description | Voice-to-MIDI real-time conversion is a challenging problem that comes with a series of obstacles and complications. The main issue is the tracking of the human voice pitch. Extracting the voice fundamental frequency can be inaccurate and highly computationally exacting due to the spectral complexity of voice signals. In addition, on account of microphone usage, the presence of environmental noise can further affect voice processing. An analysis of the current research and status of the market shows a plethora of voice-to-MIDI implementations revolving around the processing of audio signals deriving from microphones. This paper addresses the above-mentioned issues by implementing a novel experimental method where electroglottography is employed instead of microphones as a source for pitch-tracking. In the proposed system, the signal is processed and converted through an embedded hardware device. The use of electroglottography improves both the accuracy of pitch evaluation and the ease of voice information processing; firstly, it provides a direct measurement of the vocal folds' activity and, secondly, it bypasses the interferences caused by external sound sources. This allows the extraction of a simpler and cleaner signal that yields a more effective evaluation of the fundamental frequency during phonation. The proposed method delivers a faster and less computationally demanding conversion thus in turn, allowing for an efficacious real-time voice-to-MIDI conversion. |
first_indexed | 2024-04-13T22:55:34Z |
format | Article |
id | doaj.art-200129e3d043480b811a8e8598048345 |
institution | Directory Open Access Journal |
issn | 2772-5286 |
language | English |
last_indexed | 2024-04-13T22:55:34Z |
publishDate | 2022-06-01 |
publisher | Elsevier |
record_format | Article |
series | Neuroscience Informatics |
spelling | doaj.art-200129e3d043480b811a8e85980483452022-12-22T02:26:01ZengElsevierNeuroscience Informatics2772-52862022-06-0122100041Electroglottography based real-time voice-to-MIDI controllerEugenio Donati0Christos Chousidis1Corresponding author.; School of Computing and Engineering, University of West London, St. Mary's road, W55RF, London, UKSchool of Computing and Engineering, University of West London, St. Mary's road, W55RF, London, UKVoice-to-MIDI real-time conversion is a challenging problem that comes with a series of obstacles and complications. The main issue is the tracking of the human voice pitch. Extracting the voice fundamental frequency can be inaccurate and highly computationally exacting due to the spectral complexity of voice signals. In addition, on account of microphone usage, the presence of environmental noise can further affect voice processing. An analysis of the current research and status of the market shows a plethora of voice-to-MIDI implementations revolving around the processing of audio signals deriving from microphones. This paper addresses the above-mentioned issues by implementing a novel experimental method where electroglottography is employed instead of microphones as a source for pitch-tracking. In the proposed system, the signal is processed and converted through an embedded hardware device. The use of electroglottography improves both the accuracy of pitch evaluation and the ease of voice information processing; firstly, it provides a direct measurement of the vocal folds' activity and, secondly, it bypasses the interferences caused by external sound sources. This allows the extraction of a simpler and cleaner signal that yields a more effective evaluation of the fundamental frequency during phonation. The proposed method delivers a faster and less computationally demanding conversion thus in turn, allowing for an efficacious real-time voice-to-MIDI conversion.http://www.sciencedirect.com/science/article/pii/S2772528622000036ElectroglottographyBioimpedance measurementsEGG-to-MIDIVoice-to-MIDIVoice information retrievalReal-time audio conversion |
spellingShingle | Eugenio Donati Christos Chousidis Electroglottography based real-time voice-to-MIDI controller Neuroscience Informatics Electroglottography Bioimpedance measurements EGG-to-MIDI Voice-to-MIDI Voice information retrieval Real-time audio conversion |
title | Electroglottography based real-time voice-to-MIDI controller |
title_full | Electroglottography based real-time voice-to-MIDI controller |
title_fullStr | Electroglottography based real-time voice-to-MIDI controller |
title_full_unstemmed | Electroglottography based real-time voice-to-MIDI controller |
title_short | Electroglottography based real-time voice-to-MIDI controller |
title_sort | electroglottography based real time voice to midi controller |
topic | Electroglottography Bioimpedance measurements EGG-to-MIDI Voice-to-MIDI Voice information retrieval Real-time audio conversion |
url | http://www.sciencedirect.com/science/article/pii/S2772528622000036 |
work_keys_str_mv | AT eugeniodonati electroglottographybasedrealtimevoicetomidicontroller AT christoschousidis electroglottographybasedrealtimevoicetomidicontroller |