Spectral local harmonicity feature for voice activity detection

In this paper, we propose a method to exploit the harmonicity of human voiced speech using only the most harmonic sub-part of the spectrum. This technique searches for all the potential sub-windows of the spectrum, and measures their local harmonicity, using a newly proposed metric, which works in t...

Full description

Bibliographic Details
Main Authors: Khoa, Pham Chau, Siong, Chng Eng
Other Authors: School of Computer Engineering
Format: Conference Paper
Language:English
Published: 2013
Subjects:
Online Access:https://hdl.handle.net/10356/101329
http://hdl.handle.net/10220/16271
_version_ 1811682065656053760
author Khoa, Pham Chau
Siong, Chng Eng
author2 School of Computer Engineering
author_facet School of Computer Engineering
Khoa, Pham Chau
Siong, Chng Eng
author_sort Khoa, Pham Chau
collection NTU
description In this paper, we propose a method to exploit the harmonicity of human voiced speech using only the most harmonic sub-part of the spectrum. This technique searches for all the potential sub-windows of the spectrum, and measures their local harmonicity, using a newly proposed metric, which works in the spectral autocorrelation domain and employs a novel sinusoidal fitting approach. Experiments show that the new feature can be used to detect noisy voiced speech frames heavily corrupted by non-stationary noise even at 0dB SNR with high precision and recall, which gives better results than the Windowed Autocorrelation Lag Energy (WALE), a recently proposed voicing features, under a complex factory noise scenarios.
first_indexed 2024-10-01T03:50:55Z
format Conference Paper
id ntu-10356/101329
institution Nanyang Technological University
language English
last_indexed 2024-10-01T03:50:55Z
publishDate 2013
record_format dspace
spelling ntu-10356/1013292020-05-28T07:19:14Z Spectral local harmonicity feature for voice activity detection Khoa, Pham Chau Siong, Chng Eng School of Computer Engineering International Conference on Audio, Language and Image Processing (2012 : Shanghai, China) Temasek Laboratories DRNTU::Engineering::Computer science and engineering In this paper, we propose a method to exploit the harmonicity of human voiced speech using only the most harmonic sub-part of the spectrum. This technique searches for all the potential sub-windows of the spectrum, and measures their local harmonicity, using a newly proposed metric, which works in the spectral autocorrelation domain and employs a novel sinusoidal fitting approach. Experiments show that the new feature can be used to detect noisy voiced speech frames heavily corrupted by non-stationary noise even at 0dB SNR with high precision and recall, which gives better results than the Windowed Autocorrelation Lag Energy (WALE), a recently proposed voicing features, under a complex factory noise scenarios. 2013-10-04T06:29:46Z 2019-12-06T20:36:48Z 2013-10-04T06:29:46Z 2019-12-06T20:36:48Z 2012 2012 Conference Paper Khoa, P. C., & Siong, C. E. (2012). Spectral local harmonicity feature for voice activity detection. 2012 International Conference on Audio, Language and Image Processing (ICALIP). https://hdl.handle.net/10356/101329 http://hdl.handle.net/10220/16271 10.1109/ICALIP.2012.6376652 en
spellingShingle DRNTU::Engineering::Computer science and engineering
Khoa, Pham Chau
Siong, Chng Eng
Spectral local harmonicity feature for voice activity detection
title Spectral local harmonicity feature for voice activity detection
title_full Spectral local harmonicity feature for voice activity detection
title_fullStr Spectral local harmonicity feature for voice activity detection
title_full_unstemmed Spectral local harmonicity feature for voice activity detection
title_short Spectral local harmonicity feature for voice activity detection
title_sort spectral local harmonicity feature for voice activity detection
topic DRNTU::Engineering::Computer science and engineering
url https://hdl.handle.net/10356/101329
http://hdl.handle.net/10220/16271
work_keys_str_mv AT khoaphamchau spectrallocalharmonicityfeatureforvoiceactivitydetection
AT siongchngeng spectrallocalharmonicityfeatureforvoiceactivitydetection