A large TV dataset for speech and music activity detection

Abstract Automatic speech and music activity detection (SMAD) is an enabling task that can help segment, index, and pre-process audio content in radio broadcast and TV programs. However, due to copyright concerns and the cost of manual annotation, the limited availability of diverse and sizeable dat...

Full description

Bibliographic Details
Main Authors: Yun-Ning Hung, Chih-Wei Wu, Iroro Orife, Aaron Hipple, William Wolcott, Alexander Lerch
Format: Article
Language:English
Published: SpringerOpen 2022-09-01
Series:EURASIP Journal on Audio, Speech, and Music Processing
Subjects:
Online Access:https://doi.org/10.1186/s13636-022-00253-8