Vowel classification based on frequency response of vocal tract

Automatic speech recognition (ASR) has made great strides with the development of digital signal processing hardware and software especially using English as the language of choice.In this paper, a modified feature extraction approach based on frequency response model of the vocal tract and Bark sca...

Full description

Bibliographic Details
Main Authors: M.P., Paulraj, Yaacob, Sazali, Mohd Yusof, S. A.
Format: Conference or Workshop Item
Language:English
Published: 2008
Subjects:
Online Access:https://repo.uum.edu.my/id/eprint/7427/1/04580782.pdf
Description
Summary:Automatic speech recognition (ASR) has made great strides with the development of digital signal processing hardware and software especially using English as the language of choice.In this paper, a modified feature extraction approach based on frequency response model of the vocal tract and Bark scale using vowel utterances from Malaysian speakers is presented.This technique calculates mean and maximum energy values from fixed frequency bands between 20 Hz to 2500 Hz. The frequency band sizes are 100 Hz, 200 Hz, 300 Hz, 400 Hz and 500 Hz. These results are then compared with mean and maximum values of first 14 critical bands of the Bark scale.The energy features obtained are classified using multinomial logistic regression and used to detect five vowels of /a/, /e/, /i/, /o/ and /u/ recorded from 80 Malaysian speakers. The classification results obtained from the 100 Hz and 200 Hz bands gave better result than the Bark scale.Vowel /a/, /e/ and /i/ obtained a perfect 100% detection rate for both 100 Hz and 200 Hz bands.Vowel /o/ and /u/ did not fare as good but still obtained greater than 90% classification rate.