Evaluation of Glottal Inverse Filtering Techniques on OPENGLOT Synthetic Male and Female Vowels

Current articulatory-based three-dimensional source–filter models, which allow the production of vowels and diphtongs, still present very limited expressiveness. Glottal inverse filtering (GIF) techniques can become instrumental to identify specific characteristics of both the glottal source signal...

Full description

Bibliographic Details
Main Authors: Marc Freixes, Luis Joglar-Ongay, Joan Claudi Socoró, Francesc Alías-Pujol
Format: Article
Language:English
Published: MDPI AG 2023-07-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/13/15/8775
_version_ 1797587098827489280
author Marc Freixes
Luis Joglar-Ongay
Joan Claudi Socoró
Francesc Alías-Pujol
author_facet Marc Freixes
Luis Joglar-Ongay
Joan Claudi Socoró
Francesc Alías-Pujol
author_sort Marc Freixes
collection DOAJ
description Current articulatory-based three-dimensional source–filter models, which allow the production of vowels and diphtongs, still present very limited expressiveness. Glottal inverse filtering (GIF) techniques can become instrumental to identify specific characteristics of both the glottal source signal and the vocal tract transfer function to resemble expressive speech. Several GIF methods have been proposed in the literature; however, their comparison becomes difficult due to the lack of common and exhaustive experimental settings. In this work, first, a two-phase analysis methodology for the comparison of GIF techniques based on a reference dataset is introduced. Next, state-of-the-art GIF techniques based on iterative adaptive inverse filtering (IAIF) and quasi closed phase (QCP) approaches are thoroughly evaluated on OPENGLOT, an open database specifically designed to evaluate GIF, computing well-established GIF error measures after extending male vowels with their female counterparts. The results show that GIF methods obtain better results on male vowels. The QCP-based techniques significantly outperform IAIF-based methods for almost all error metrics and scenarios and are, at the same time, more stable across sex, phonation type, F0, and vowels. The IAIF variants improve the original technique for most error metrics on male vowels, while QCP with spectral tilt compensation achieves a lower spectral tilt error for male vowels than the original QCP.
first_indexed 2024-03-11T00:31:29Z
format Article
id doaj.art-9403ebdb50734b02900cbd610a36257c
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-11T00:31:29Z
publishDate 2023-07-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-9403ebdb50734b02900cbd610a36257c2023-11-18T22:37:25ZengMDPI AGApplied Sciences2076-34172023-07-011315877510.3390/app13158775Evaluation of Glottal Inverse Filtering Techniques on OPENGLOT Synthetic Male and Female VowelsMarc Freixes0Luis Joglar-Ongay1Joan Claudi Socoró2Francesc Alías-Pujol3Human-Environment Research (HER), La Salle—Universitat Ramon Llull, Sant Joan de la Salle, 42, 08022 Barcelona, SpainHuman-Environment Research (HER), La Salle—Universitat Ramon Llull, Sant Joan de la Salle, 42, 08022 Barcelona, SpainHuman-Environment Research (HER), La Salle—Universitat Ramon Llull, Sant Joan de la Salle, 42, 08022 Barcelona, SpainHuman-Environment Research (HER), La Salle—Universitat Ramon Llull, Sant Joan de la Salle, 42, 08022 Barcelona, SpainCurrent articulatory-based three-dimensional source–filter models, which allow the production of vowels and diphtongs, still present very limited expressiveness. Glottal inverse filtering (GIF) techniques can become instrumental to identify specific characteristics of both the glottal source signal and the vocal tract transfer function to resemble expressive speech. Several GIF methods have been proposed in the literature; however, their comparison becomes difficult due to the lack of common and exhaustive experimental settings. In this work, first, a two-phase analysis methodology for the comparison of GIF techniques based on a reference dataset is introduced. Next, state-of-the-art GIF techniques based on iterative adaptive inverse filtering (IAIF) and quasi closed phase (QCP) approaches are thoroughly evaluated on OPENGLOT, an open database specifically designed to evaluate GIF, computing well-established GIF error measures after extending male vowels with their female counterparts. The results show that GIF methods obtain better results on male vowels. The QCP-based techniques significantly outperform IAIF-based methods for almost all error metrics and scenarios and are, at the same time, more stable across sex, phonation type, F0, and vowels. The IAIF variants improve the original technique for most error metrics on male vowels, while QCP with spectral tilt compensation achieves a lower spectral tilt error for male vowels than the original QCP.https://www.mdpi.com/2076-3417/13/15/8775performance evaluationglottal inverse filteringglottal sourcephonation typesspeech analysisOPENGLOT
spellingShingle Marc Freixes
Luis Joglar-Ongay
Joan Claudi Socoró
Francesc Alías-Pujol
Evaluation of Glottal Inverse Filtering Techniques on OPENGLOT Synthetic Male and Female Vowels
Applied Sciences
performance evaluation
glottal inverse filtering
glottal source
phonation types
speech analysis
OPENGLOT
title Evaluation of Glottal Inverse Filtering Techniques on OPENGLOT Synthetic Male and Female Vowels
title_full Evaluation of Glottal Inverse Filtering Techniques on OPENGLOT Synthetic Male and Female Vowels
title_fullStr Evaluation of Glottal Inverse Filtering Techniques on OPENGLOT Synthetic Male and Female Vowels
title_full_unstemmed Evaluation of Glottal Inverse Filtering Techniques on OPENGLOT Synthetic Male and Female Vowels
title_short Evaluation of Glottal Inverse Filtering Techniques on OPENGLOT Synthetic Male and Female Vowels
title_sort evaluation of glottal inverse filtering techniques on openglot synthetic male and female vowels
topic performance evaluation
glottal inverse filtering
glottal source
phonation types
speech analysis
OPENGLOT
url https://www.mdpi.com/2076-3417/13/15/8775
work_keys_str_mv AT marcfreixes evaluationofglottalinversefilteringtechniquesonopenglotsyntheticmaleandfemalevowels
AT luisjoglarongay evaluationofglottalinversefilteringtechniquesonopenglotsyntheticmaleandfemalevowels
AT joanclaudisocoro evaluationofglottalinversefilteringtechniquesonopenglotsyntheticmaleandfemalevowels
AT francescaliaspujol evaluationofglottalinversefilteringtechniquesonopenglotsyntheticmaleandfemalevowels