Automatic phonetic segmentation of malay speech database

This paper deals with automatic phonetic segmentation for Malay continuous speech. This study investigates fast and automatic phone segmentation in preparing database for Malay concatenative Text-to-Speech (TTS) systems. A 35 Malay phone set has been chosen, which is suitable for building Malay TTS....

Full description

Bibliographic Details
Main Authors: Ting, Chee Ming, Shaikh Salleh, Sheikh Hussain, Tan, Tian Swee, Ariff, Ahmad Kamarul
Format: Conference or Workshop Item
Language:English
Published: 2007
Subjects:
Online Access:http://eprints.utm.my/7635/1/Sheikh_Hussain_Shaikh_2007_Automatic_Phonetic_Segmentation_of_Malay_Speech.pdf
_version_ 1825910069396504576
author Ting, Chee Ming
Shaikh Salleh, Sheikh Hussain
Tan, Tian Swee
Ariff, Ahmad Kamarul
author_facet Ting, Chee Ming
Shaikh Salleh, Sheikh Hussain
Tan, Tian Swee
Ariff, Ahmad Kamarul
author_sort Ting, Chee Ming
collection ePrints
description This paper deals with automatic phonetic segmentation for Malay continuous speech. This study investigates fast and automatic phone segmentation in preparing database for Malay concatenative Text-to-Speech (TTS) systems. A 35 Malay phone set has been chosen, which is suitable for building Malay TTS. The segmentation experiment is based on this phone set. HMM based segmentation approach which uses Viterbi force alignment technique is adapted. We use continuous density HMM (CDHMM) with Gaussian mixture which is performs well in speech recognition to prevent large segmentation errors. Besides, this paper presents an implicit boundary refinement method that is incorporated in the Viterbi phonetic alignment. In this approach, the HMM model is trained with phone tokens with their boundaries extended to the be-side phones. This increases the ability of the HMM in modeling phone boundaries and provides effect of implicit boundary refinement when used in phonetic alignment thus reduce segmentation errors. This approach improves increase the performance of baseline HMM segmentation from 42.39%, 74.83%, 84.34% of automatic boundary marks within error smaller than 5, 15, and 25ms to 47.75%, 76.38%, 85.55%.
first_indexed 2024-03-05T18:11:35Z
format Conference or Workshop Item
id utm.eprints-7635
institution Universiti Teknologi Malaysia - ePrints
language English
last_indexed 2024-03-05T18:11:35Z
publishDate 2007
record_format dspace
spelling utm.eprints-76352010-06-01T15:54:10Z http://eprints.utm.my/7635/ Automatic phonetic segmentation of malay speech database Ting, Chee Ming Shaikh Salleh, Sheikh Hussain Tan, Tian Swee Ariff, Ahmad Kamarul TK Electrical engineering. Electronics Nuclear engineering This paper deals with automatic phonetic segmentation for Malay continuous speech. This study investigates fast and automatic phone segmentation in preparing database for Malay concatenative Text-to-Speech (TTS) systems. A 35 Malay phone set has been chosen, which is suitable for building Malay TTS. The segmentation experiment is based on this phone set. HMM based segmentation approach which uses Viterbi force alignment technique is adapted. We use continuous density HMM (CDHMM) with Gaussian mixture which is performs well in speech recognition to prevent large segmentation errors. Besides, this paper presents an implicit boundary refinement method that is incorporated in the Viterbi phonetic alignment. In this approach, the HMM model is trained with phone tokens with their boundaries extended to the be-side phones. This increases the ability of the HMM in modeling phone boundaries and provides effect of implicit boundary refinement when used in phonetic alignment thus reduce segmentation errors. This approach improves increase the performance of baseline HMM segmentation from 42.39%, 74.83%, 84.34% of automatic boundary marks within error smaller than 5, 15, and 25ms to 47.75%, 76.38%, 85.55%. 2007-12 Conference or Workshop Item PeerReviewed application/pdf en http://eprints.utm.my/7635/1/Sheikh_Hussain_Shaikh_2007_Automatic_Phonetic_Segmentation_of_Malay_Speech.pdf Ting, Chee Ming and Shaikh Salleh, Sheikh Hussain and Tan, Tian Swee and Ariff, Ahmad Kamarul (2007) Automatic phonetic segmentation of malay speech database. In: Information, Communications & Signal Processing, 2007 6th International Conference, 10-13 Dec 2007, Singapore. http://dx.doi.org/10.1109/ICICS.2007.4449574
spellingShingle TK Electrical engineering. Electronics Nuclear engineering
Ting, Chee Ming
Shaikh Salleh, Sheikh Hussain
Tan, Tian Swee
Ariff, Ahmad Kamarul
Automatic phonetic segmentation of malay speech database
title Automatic phonetic segmentation of malay speech database
title_full Automatic phonetic segmentation of malay speech database
title_fullStr Automatic phonetic segmentation of malay speech database
title_full_unstemmed Automatic phonetic segmentation of malay speech database
title_short Automatic phonetic segmentation of malay speech database
title_sort automatic phonetic segmentation of malay speech database
topic TK Electrical engineering. Electronics Nuclear engineering
url http://eprints.utm.my/7635/1/Sheikh_Hussain_Shaikh_2007_Automatic_Phonetic_Segmentation_of_Malay_Speech.pdf
work_keys_str_mv AT tingcheeming automaticphoneticsegmentationofmalayspeechdatabase
AT shaikhsallehsheikhhussain automaticphoneticsegmentationofmalayspeechdatabase
AT tantianswee automaticphoneticsegmentationofmalayspeechdatabase
AT ariffahmadkamarul automaticphoneticsegmentationofmalayspeechdatabase