Auditory Device Voice Activity Detection Based on Statistical Likelihood-Ratio Order Statistics

This paper proposes a technique for improving statistical-model-based voice activity detection (VAD) in noisy environments to be applied in an auditory hearing aid. The proposed method is implemented for a uniform polyphase discrete Fourier transform filter bank satisfying an auditory device time la...

Full description

Bibliographic Details
Main Author: Seon Man Kim
Format: Article
Language:English
Published: MDPI AG 2020-07-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/10/15/5026
_version_ 1797561739386028032
author Seon Man Kim
author_facet Seon Man Kim
author_sort Seon Man Kim
collection DOAJ
description This paper proposes a technique for improving statistical-model-based voice activity detection (VAD) in noisy environments to be applied in an auditory hearing aid. The proposed method is implemented for a uniform polyphase discrete Fourier transform filter bank satisfying an auditory device time latency of 8 ms. The proposed VAD technique provides an online unified framework to overcome the frequent false rejection of the statistical-model-based likelihood-ratio test (LRT) in noisy environments. The method is based on the observation that the sparseness of speech and background noise cause high false-rejection error rates in statistical LRT-based VAD—the false rejection rate increases as the sparseness increases. We demonstrate that the false-rejection error rate can be reduced by incorporating likelihood-ratio order statistics into a conventional LRT VAD. We confirm experimentally that the proposed method relatively reduces the average detection error rate by 15.8% compared to a conventional VAD with only minimal change in the false acceptance probability for three different noise conditions whose signal-to-noise ratio ranges from 0 to 20 dB.
first_indexed 2024-03-10T18:18:57Z
format Article
id doaj.art-851f174a0fe149bda102c2776c0c5fd0
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-10T18:18:57Z
publishDate 2020-07-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-851f174a0fe149bda102c2776c0c5fd02023-11-20T07:30:36ZengMDPI AGApplied Sciences2076-34172020-07-011015502610.3390/app10155026Auditory Device Voice Activity Detection Based on Statistical Likelihood-Ratio Order StatisticsSeon Man Kim0Korea Photonics Technology Institute, Gwangju 61007, KoreaThis paper proposes a technique for improving statistical-model-based voice activity detection (VAD) in noisy environments to be applied in an auditory hearing aid. The proposed method is implemented for a uniform polyphase discrete Fourier transform filter bank satisfying an auditory device time latency of 8 ms. The proposed VAD technique provides an online unified framework to overcome the frequent false rejection of the statistical-model-based likelihood-ratio test (LRT) in noisy environments. The method is based on the observation that the sparseness of speech and background noise cause high false-rejection error rates in statistical LRT-based VAD—the false rejection rate increases as the sparseness increases. We demonstrate that the false-rejection error rate can be reduced by incorporating likelihood-ratio order statistics into a conventional LRT VAD. We confirm experimentally that the proposed method relatively reduces the average detection error rate by 15.8% compared to a conventional VAD with only minimal change in the false acceptance probability for three different noise conditions whose signal-to-noise ratio ranges from 0 to 20 dB.https://www.mdpi.com/2076-3417/10/15/5026voice activity detectionlikelihood-ratio testorder statisticsstatistical modelfalse rejectionauditory device
spellingShingle Seon Man Kim
Auditory Device Voice Activity Detection Based on Statistical Likelihood-Ratio Order Statistics
Applied Sciences
voice activity detection
likelihood-ratio test
order statistics
statistical model
false rejection
auditory device
title Auditory Device Voice Activity Detection Based on Statistical Likelihood-Ratio Order Statistics
title_full Auditory Device Voice Activity Detection Based on Statistical Likelihood-Ratio Order Statistics
title_fullStr Auditory Device Voice Activity Detection Based on Statistical Likelihood-Ratio Order Statistics
title_full_unstemmed Auditory Device Voice Activity Detection Based on Statistical Likelihood-Ratio Order Statistics
title_short Auditory Device Voice Activity Detection Based on Statistical Likelihood-Ratio Order Statistics
title_sort auditory device voice activity detection based on statistical likelihood ratio order statistics
topic voice activity detection
likelihood-ratio test
order statistics
statistical model
false rejection
auditory device
url https://www.mdpi.com/2076-3417/10/15/5026
work_keys_str_mv AT seonmankim auditorydevicevoiceactivitydetectionbasedonstatisticallikelihoodratioorderstatistics