Auditory Device Voice Activity Detection Based on Statistical Likelihood-Ratio Order Statistics
This paper proposes a technique for improving statistical-model-based voice activity detection (VAD) in noisy environments to be applied in an auditory hearing aid. The proposed method is implemented for a uniform polyphase discrete Fourier transform filter bank satisfying an auditory device time la...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-07-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/10/15/5026 |
_version_ | 1797561739386028032 |
---|---|
author | Seon Man Kim |
author_facet | Seon Man Kim |
author_sort | Seon Man Kim |
collection | DOAJ |
description | This paper proposes a technique for improving statistical-model-based voice activity detection (VAD) in noisy environments to be applied in an auditory hearing aid. The proposed method is implemented for a uniform polyphase discrete Fourier transform filter bank satisfying an auditory device time latency of 8 ms. The proposed VAD technique provides an online unified framework to overcome the frequent false rejection of the statistical-model-based likelihood-ratio test (LRT) in noisy environments. The method is based on the observation that the sparseness of speech and background noise cause high false-rejection error rates in statistical LRT-based VAD—the false rejection rate increases as the sparseness increases. We demonstrate that the false-rejection error rate can be reduced by incorporating likelihood-ratio order statistics into a conventional LRT VAD. We confirm experimentally that the proposed method relatively reduces the average detection error rate by 15.8% compared to a conventional VAD with only minimal change in the false acceptance probability for three different noise conditions whose signal-to-noise ratio ranges from 0 to 20 dB. |
first_indexed | 2024-03-10T18:18:57Z |
format | Article |
id | doaj.art-851f174a0fe149bda102c2776c0c5fd0 |
institution | Directory Open Access Journal |
issn | 2076-3417 |
language | English |
last_indexed | 2024-03-10T18:18:57Z |
publishDate | 2020-07-01 |
publisher | MDPI AG |
record_format | Article |
series | Applied Sciences |
spelling | doaj.art-851f174a0fe149bda102c2776c0c5fd02023-11-20T07:30:36ZengMDPI AGApplied Sciences2076-34172020-07-011015502610.3390/app10155026Auditory Device Voice Activity Detection Based on Statistical Likelihood-Ratio Order StatisticsSeon Man Kim0Korea Photonics Technology Institute, Gwangju 61007, KoreaThis paper proposes a technique for improving statistical-model-based voice activity detection (VAD) in noisy environments to be applied in an auditory hearing aid. The proposed method is implemented for a uniform polyphase discrete Fourier transform filter bank satisfying an auditory device time latency of 8 ms. The proposed VAD technique provides an online unified framework to overcome the frequent false rejection of the statistical-model-based likelihood-ratio test (LRT) in noisy environments. The method is based on the observation that the sparseness of speech and background noise cause high false-rejection error rates in statistical LRT-based VAD—the false rejection rate increases as the sparseness increases. We demonstrate that the false-rejection error rate can be reduced by incorporating likelihood-ratio order statistics into a conventional LRT VAD. We confirm experimentally that the proposed method relatively reduces the average detection error rate by 15.8% compared to a conventional VAD with only minimal change in the false acceptance probability for three different noise conditions whose signal-to-noise ratio ranges from 0 to 20 dB.https://www.mdpi.com/2076-3417/10/15/5026voice activity detectionlikelihood-ratio testorder statisticsstatistical modelfalse rejectionauditory device |
spellingShingle | Seon Man Kim Auditory Device Voice Activity Detection Based on Statistical Likelihood-Ratio Order Statistics Applied Sciences voice activity detection likelihood-ratio test order statistics statistical model false rejection auditory device |
title | Auditory Device Voice Activity Detection Based on Statistical Likelihood-Ratio Order Statistics |
title_full | Auditory Device Voice Activity Detection Based on Statistical Likelihood-Ratio Order Statistics |
title_fullStr | Auditory Device Voice Activity Detection Based on Statistical Likelihood-Ratio Order Statistics |
title_full_unstemmed | Auditory Device Voice Activity Detection Based on Statistical Likelihood-Ratio Order Statistics |
title_short | Auditory Device Voice Activity Detection Based on Statistical Likelihood-Ratio Order Statistics |
title_sort | auditory device voice activity detection based on statistical likelihood ratio order statistics |
topic | voice activity detection likelihood-ratio test order statistics statistical model false rejection auditory device |
url | https://www.mdpi.com/2076-3417/10/15/5026 |
work_keys_str_mv | AT seonmankim auditorydevicevoiceactivitydetectionbasedonstatisticallikelihoodratioorderstatistics |