A smart eavesdropping system : recognizing keywords by human subjects

Speech Recognition (SR) gains its popularity in research area as the advance of modern technologies. It can translate speech into text with the aid of computers and speech recognition applications. In this final year project, an open source speech recognition engine named Pocketsphinx from Carnegie...

Full description

Bibliographic Details
Main Author: Zhang, Chun Meng
Other Authors: Khong Andy Wai Hoong
Format: Final Year Project (FYP)
Language:English
Published: 2015
Subjects:
Online Access:http://hdl.handle.net/10356/63768
_version_ 1826118967181180928
author Zhang, Chun Meng
author2 Khong Andy Wai Hoong
author_facet Khong Andy Wai Hoong
Zhang, Chun Meng
author_sort Zhang, Chun Meng
collection NTU
description Speech Recognition (SR) gains its popularity in research area as the advance of modern technologies. It can translate speech into text with the aid of computers and speech recognition applications. In this final year project, an open source speech recognition engine named Pocketsphinx from Carnegie Mellon University (CMU) is integrated into our existing Eavesdropping System to perform speech recognition or keyword spotting tasks on the output audio files from the system. This report covers the details on the development of whole speech recognition framework assembled. Pocketsphinx is compiled and installed in a Linux server remotely and communicates with the client Matlab programs using network sockets. System parameters are carefully tuned to ensure the performance. Experiments on different combinations of Acoustic Models (AM) and Language Models (LM) are also conducted and evaluated. Acoustic model adaptation which adapts the speech recognizer into specific acoustic environment or speaker to enhance the recognition performance is also presented. Furthermore, another commercially available speech recognition application named Dragon Naturally Speaking (DNS) 12 is also experimented and compared with Pocketsphinx used in the system.
first_indexed 2024-10-01T04:52:13Z
format Final Year Project (FYP)
id ntu-10356/63768
institution Nanyang Technological University
language English
last_indexed 2024-10-01T04:52:13Z
publishDate 2015
record_format dspace
spelling ntu-10356/637682023-07-07T16:50:36Z A smart eavesdropping system : recognizing keywords by human subjects Zhang, Chun Meng Khong Andy Wai Hoong School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering Speech Recognition (SR) gains its popularity in research area as the advance of modern technologies. It can translate speech into text with the aid of computers and speech recognition applications. In this final year project, an open source speech recognition engine named Pocketsphinx from Carnegie Mellon University (CMU) is integrated into our existing Eavesdropping System to perform speech recognition or keyword spotting tasks on the output audio files from the system. This report covers the details on the development of whole speech recognition framework assembled. Pocketsphinx is compiled and installed in a Linux server remotely and communicates with the client Matlab programs using network sockets. System parameters are carefully tuned to ensure the performance. Experiments on different combinations of Acoustic Models (AM) and Language Models (LM) are also conducted and evaluated. Acoustic model adaptation which adapts the speech recognizer into specific acoustic environment or speaker to enhance the recognition performance is also presented. Furthermore, another commercially available speech recognition application named Dragon Naturally Speaking (DNS) 12 is also experimented and compared with Pocketsphinx used in the system. Bachelor of Engineering 2015-05-19T02:26:48Z 2015-05-19T02:26:48Z 2015 2015 Final Year Project (FYP) http://hdl.handle.net/10356/63768 en Nanyang Technological University 62 p. application/pdf
spellingShingle DRNTU::Engineering::Electrical and electronic engineering
Zhang, Chun Meng
A smart eavesdropping system : recognizing keywords by human subjects
title A smart eavesdropping system : recognizing keywords by human subjects
title_full A smart eavesdropping system : recognizing keywords by human subjects
title_fullStr A smart eavesdropping system : recognizing keywords by human subjects
title_full_unstemmed A smart eavesdropping system : recognizing keywords by human subjects
title_short A smart eavesdropping system : recognizing keywords by human subjects
title_sort smart eavesdropping system recognizing keywords by human subjects
topic DRNTU::Engineering::Electrical and electronic engineering
url http://hdl.handle.net/10356/63768
work_keys_str_mv AT zhangchunmeng asmarteavesdroppingsystemrecognizingkeywordsbyhumansubjects
AT zhangchunmeng smarteavesdroppingsystemrecognizingkeywordsbyhumansubjects