Voice Activity Detection Optimized by Adaptive Attention Span Transformer

Voice Activity Detection (VAD) is a widely used technique for separating vocal regions from audio signals, with applications in voice language coding, noise reduction, and other domains. While various strategies have been proposed to improve VAD performance, such as ACAM, DCU-10, and Tr-VAD, these a...

Full description

Bibliographic Details
Main Authors: Wenpeng Mu, Bingshan Liu
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10083136/