Burmese speech word recognition

The aim of this study is to develop a Burmese speech word recognition system. A phone level (sub-word) based voice-operated interface for phone dialing system is implemented using the Hidden Markov Model Tool Kit (HTK). The Mel Frequency Cepstral Coefficients (MFCCs) and Perceptual Linear Predictio...

Full description

Bibliographic Details
Main Author: Zaw, Wai Phyo
Other Authors: Soon Ing Yann
Format: Thesis
Language:English
Published: 2010
Subjects:
Online Access:http://hdl.handle.net/10356/41421
_version_ 1811689704965275648
author Zaw, Wai Phyo
author2 Soon Ing Yann
author_facet Soon Ing Yann
Zaw, Wai Phyo
author_sort Zaw, Wai Phyo
collection NTU
description The aim of this study is to develop a Burmese speech word recognition system. A phone level (sub-word) based voice-operated interface for phone dialing system is implemented using the Hidden Markov Model Tool Kit (HTK). The Mel Frequency Cepstral Coefficients (MFCCs) and Perceptual Linear Prediction Cepstral Coefficients (PLPCCs) are employed as feature in this project. In this study, the method called Half-Wave Rectification is applied to the MFCC to enhance the spectral peaks. The recognizer is evaluated with MFCCs and PLPCCs as well as rectified MFCCs. The evaluated results are discussed and finally, this study also provides the recommendation and future works to extend this project.
first_indexed 2024-10-01T05:52:20Z
format Thesis
id ntu-10356/41421
institution Nanyang Technological University
language English
last_indexed 2024-10-01T05:52:20Z
publishDate 2010
record_format dspace
spelling ntu-10356/414212023-07-04T15:29:28Z Burmese speech word recognition Zaw, Wai Phyo Soon Ing Yann School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing The aim of this study is to develop a Burmese speech word recognition system. A phone level (sub-word) based voice-operated interface for phone dialing system is implemented using the Hidden Markov Model Tool Kit (HTK). The Mel Frequency Cepstral Coefficients (MFCCs) and Perceptual Linear Prediction Cepstral Coefficients (PLPCCs) are employed as feature in this project. In this study, the method called Half-Wave Rectification is applied to the MFCC to enhance the spectral peaks. The recognizer is evaluated with MFCCs and PLPCCs as well as rectified MFCCs. The evaluated results are discussed and finally, this study also provides the recommendation and future works to extend this project. Master of Science (Signal Processing) 2010-07-02T08:21:05Z 2010-07-02T08:21:05Z 2008 2008 Thesis http://hdl.handle.net/10356/41421 en 75 p. application/pdf
spellingShingle DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
Zaw, Wai Phyo
Burmese speech word recognition
title Burmese speech word recognition
title_full Burmese speech word recognition
title_fullStr Burmese speech word recognition
title_full_unstemmed Burmese speech word recognition
title_short Burmese speech word recognition
title_sort burmese speech word recognition
topic DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
url http://hdl.handle.net/10356/41421
work_keys_str_mv AT zawwaiphyo burmesespeechwordrecognition