Machine Learning–based Analysis of English Lateral Allophones

Automatic classification methods, such as artificial neural networks (ANNs), the k-nearest neighbor (kNN) and self-organizing maps (SOMs), are applied to allophone analysis based on recorded speech. A list of 650 words was created for that purpose, containing positionally and/or contextually conditi...

Full description

Bibliographic Details
Main Authors: Piotrowska Magdalena, Korvel Gražina, Kostek Bożena, Ciszewski Tomasz, Cżyzewski Andrzej
Format: Article
Language:English
Published: Sciendo 2019-06-01
Series:International Journal of Applied Mathematics and Computer Science
Subjects:
Online Access:https://doi.org/10.2478/amcs-2019-0029
_version_ 1818587142766460928
author Piotrowska Magdalena
Korvel Gražina
Kostek Bożena
Ciszewski Tomasz
Cżyzewski Andrzej
author_facet Piotrowska Magdalena
Korvel Gražina
Kostek Bożena
Ciszewski Tomasz
Cżyzewski Andrzej
author_sort Piotrowska Magdalena
collection DOAJ
description Automatic classification methods, such as artificial neural networks (ANNs), the k-nearest neighbor (kNN) and self-organizing maps (SOMs), are applied to allophone analysis based on recorded speech. A list of 650 words was created for that purpose, containing positionally and/or contextually conditioned allophones. For each word, a group of 16 native and non-native speakers were audio-video recorded, from which seven native speakers’ and phonology experts’ speech was selected for analyses. For the purpose of the present study, a sub-list of 103 words containing the English alveolar lateral phoneme /l/ was compiled. The list includes ‘dark’ (velarized) allophonic realizations (which occur before a consonant or at the end of the word before silence) and 52 ‘clear’ allophonic realizations (which occur before a vowel), as well as voicing variants. The recorded signals were segmented into allophones and parametrized using a set of descriptors, originating from the MPEG 7 standard, plus dedicated time-based parameters as well as modified MFCC features proposed by the authors. Classification methods such as ANNs, the kNN and the SOM were employed to automatically detect the two types of allophones. Various sets of features were tested to achieve the best performance of the automatic methods. In the final experiment, a selected set of features was used for automatic evaluation of the pronunciation of dark /l/ by non-native speakers.
first_indexed 2024-12-16T09:04:10Z
format Article
id doaj.art-dd056168f88c4493970a53c5f3a7ab07
institution Directory Open Access Journal
issn 2083-8492
language English
last_indexed 2024-12-16T09:04:10Z
publishDate 2019-06-01
publisher Sciendo
record_format Article
series International Journal of Applied Mathematics and Computer Science
spelling doaj.art-dd056168f88c4493970a53c5f3a7ab072022-12-21T22:37:07ZengSciendoInternational Journal of Applied Mathematics and Computer Science2083-84922019-06-0129239340510.2478/amcs-2019-0029amcs-2019-0029Machine Learning–based Analysis of English Lateral AllophonesPiotrowska Magdalena0Korvel Gražina1Kostek Bożena2Ciszewski Tomasz3Cżyzewski Andrzej4Multimedia Systems Department, Gdansk University of Technology, G. Narutowicza 11/12, 80-233Gdansk, PolandInstitute of Data Science and Digital Technologies, Vilnius University, Akademijos 4, LT-04812Vilnius, LithuaniaLaboratory of Audio Acoustics, Gdansk University of Technology, G. Narutowicza 11/12, 80-233Gdansk, PolandMultimedia Systems Department, Gdansk University of Technology, G. Narutowicza 11/12, 80-233Gdansk, PolandMultimedia Systems Department, Gdansk University of Technology, G. Narutowicza 11/12, 80-233Gdansk, PolandAutomatic classification methods, such as artificial neural networks (ANNs), the k-nearest neighbor (kNN) and self-organizing maps (SOMs), are applied to allophone analysis based on recorded speech. A list of 650 words was created for that purpose, containing positionally and/or contextually conditioned allophones. For each word, a group of 16 native and non-native speakers were audio-video recorded, from which seven native speakers’ and phonology experts’ speech was selected for analyses. For the purpose of the present study, a sub-list of 103 words containing the English alveolar lateral phoneme /l/ was compiled. The list includes ‘dark’ (velarized) allophonic realizations (which occur before a consonant or at the end of the word before silence) and 52 ‘clear’ allophonic realizations (which occur before a vowel), as well as voicing variants. The recorded signals were segmented into allophones and parametrized using a set of descriptors, originating from the MPEG 7 standard, plus dedicated time-based parameters as well as modified MFCC features proposed by the authors. Classification methods such as ANNs, the kNN and the SOM were employed to automatically detect the two types of allophones. Various sets of features were tested to achieve the best performance of the automatic methods. In the final experiment, a selected set of features was used for automatic evaluation of the pronunciation of dark /l/ by non-native speakers.https://doi.org/10.2478/amcs-2019-0029allophonesaudio featuresartificial neural networks (anns)k-nearest neighbor (knn)self-organizing map (som)
spellingShingle Piotrowska Magdalena
Korvel Gražina
Kostek Bożena
Ciszewski Tomasz
Cżyzewski Andrzej
Machine Learning–based Analysis of English Lateral Allophones
International Journal of Applied Mathematics and Computer Science
allophones
audio features
artificial neural networks (anns)
k-nearest neighbor (knn)
self-organizing map (som)
title Machine Learning–based Analysis of English Lateral Allophones
title_full Machine Learning–based Analysis of English Lateral Allophones
title_fullStr Machine Learning–based Analysis of English Lateral Allophones
title_full_unstemmed Machine Learning–based Analysis of English Lateral Allophones
title_short Machine Learning–based Analysis of English Lateral Allophones
title_sort machine learning based analysis of english lateral allophones
topic allophones
audio features
artificial neural networks (anns)
k-nearest neighbor (knn)
self-organizing map (som)
url https://doi.org/10.2478/amcs-2019-0029
work_keys_str_mv AT piotrowskamagdalena machinelearningbasedanalysisofenglishlateralallophones
AT korvelgrazina machinelearningbasedanalysisofenglishlateralallophones
AT kostekbozena machinelearningbasedanalysisofenglishlateralallophones
AT ciszewskitomasz machinelearningbasedanalysisofenglishlateralallophones
AT czyzewskiandrzej machinelearningbasedanalysisofenglishlateralallophones