Burmese speech word recognition
The aim of this study is to develop a Burmese speech word recognition system. A phone level (sub-word) based voice-operated interface for phone dialing system is implemented using the Hidden Markov Model Tool Kit (HTK). The Mel Frequency Cepstral Coefficients (MFCCs) and Perceptual Linear Predictio...
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Language: | English |
Published: |
2010
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/41421 |
_version_ | 1811689704965275648 |
---|---|
author | Zaw, Wai Phyo |
author2 | Soon Ing Yann |
author_facet | Soon Ing Yann Zaw, Wai Phyo |
author_sort | Zaw, Wai Phyo |
collection | NTU |
description | The aim of this study is to develop a Burmese speech word recognition system. A phone level (sub-word) based voice-operated interface for phone dialing system is implemented using the Hidden Markov Model Tool Kit (HTK). The Mel Frequency Cepstral
Coefficients (MFCCs) and Perceptual Linear Prediction Cepstral Coefficients (PLPCCs)
are employed as feature in this project. In this study, the method called Half-Wave
Rectification is applied to the MFCC to enhance the spectral peaks. The recognizer is
evaluated with MFCCs and PLPCCs as well as rectified MFCCs. The evaluated results
are discussed and finally, this study also provides the recommendation and future works
to extend this project. |
first_indexed | 2024-10-01T05:52:20Z |
format | Thesis |
id | ntu-10356/41421 |
institution | Nanyang Technological University |
language | English |
last_indexed | 2024-10-01T05:52:20Z |
publishDate | 2010 |
record_format | dspace |
spelling | ntu-10356/414212023-07-04T15:29:28Z Burmese speech word recognition Zaw, Wai Phyo Soon Ing Yann School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing The aim of this study is to develop a Burmese speech word recognition system. A phone level (sub-word) based voice-operated interface for phone dialing system is implemented using the Hidden Markov Model Tool Kit (HTK). The Mel Frequency Cepstral Coefficients (MFCCs) and Perceptual Linear Prediction Cepstral Coefficients (PLPCCs) are employed as feature in this project. In this study, the method called Half-Wave Rectification is applied to the MFCC to enhance the spectral peaks. The recognizer is evaluated with MFCCs and PLPCCs as well as rectified MFCCs. The evaluated results are discussed and finally, this study also provides the recommendation and future works to extend this project. Master of Science (Signal Processing) 2010-07-02T08:21:05Z 2010-07-02T08:21:05Z 2008 2008 Thesis http://hdl.handle.net/10356/41421 en 75 p. application/pdf |
spellingShingle | DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing Zaw, Wai Phyo Burmese speech word recognition |
title | Burmese speech word recognition |
title_full | Burmese speech word recognition |
title_fullStr | Burmese speech word recognition |
title_full_unstemmed | Burmese speech word recognition |
title_short | Burmese speech word recognition |
title_sort | burmese speech word recognition |
topic | DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing |
url | http://hdl.handle.net/10356/41421 |
work_keys_str_mv | AT zawwaiphyo burmesespeechwordrecognition |