Comparison of four classifiers for speech-music discrimination: a first case study for costa rican radio broadcasting

During the past decades, a vast amount of audio data has be- come available in most languages and regions of the world. The efficient organization and manipulation of this data are important for tasks such as data classification, searching for information, diarization among many others, but also ca...

Full description

Bibliographic Details
Main Authors: Joseline Sánchez-Solís, Marvin Coto-Jiménez
Format: Article
Language:Spanish
Published: Instituto Tecnológico de Costa Rica 2022-11-01
Series:Tecnología en Marcha
Subjects:
Online Access:https://172.20.14.50/index.php/tec_marcha/article/view/6463
_version_ 1827786545496588288
author Joseline Sánchez-Solís
Marvin Coto-Jiménez
author_facet Joseline Sánchez-Solís
Marvin Coto-Jiménez
author_sort Joseline Sánchez-Solís
collection DOAJ
description During the past decades, a vast amount of audio data has be- come available in most languages and regions of the world. The efficient organization and manipulation of this data are important for tasks such as data classification, searching for information, diarization among many others, but also can be relevant for building corpora for training models for automatic speech recognition or building speech synthesis systems. Several of those tasks require extensive testing and data for specific languages and accents, especially when the development of communication systems with machines is a goal. In this work, we explore the application of several classifiers for the task of discriminating speech and music in Costa Rican radio broadcast. This discrimination is a first task in the exploration of a large corpus, to determine whether or not the available information is useful for particular research areas. The main contribution of this exploratory work is the general procedure and selection of algorithms for the Costa Rican radio corpus, which can lead to the extensive use of this source of data in many own applications and systems.
first_indexed 2024-03-11T16:36:47Z
format Article
id doaj.art-101ef310da5b4fe2acb12c110b79c9b9
institution Directory Open Access Journal
issn 0379-3982
2215-3241
language Spanish
last_indexed 2024-03-11T16:36:47Z
publishDate 2022-11-01
publisher Instituto Tecnológico de Costa Rica
record_format Article
series Tecnología en Marcha
spelling doaj.art-101ef310da5b4fe2acb12c110b79c9b92023-10-23T14:27:32ZspaInstituto Tecnológico de Costa RicaTecnología en Marcha0379-39822215-32412022-11-0135810.18845/tm.v35i8.6463Comparison of four classifiers for speech-music discrimination: a first case study for costa rican radio broadcastingJoseline Sánchez-SolísMarvin Coto-Jiménez During the past decades, a vast amount of audio data has be- come available in most languages and regions of the world. The efficient organization and manipulation of this data are important for tasks such as data classification, searching for information, diarization among many others, but also can be relevant for building corpora for training models for automatic speech recognition or building speech synthesis systems. Several of those tasks require extensive testing and data for specific languages and accents, especially when the development of communication systems with machines is a goal. In this work, we explore the application of several classifiers for the task of discriminating speech and music in Costa Rican radio broadcast. This discrimination is a first task in the exploration of a large corpus, to determine whether or not the available information is useful for particular research areas. The main contribution of this exploratory work is the general procedure and selection of algorithms for the Costa Rican radio corpus, which can lead to the extensive use of this source of data in many own applications and systems. https://172.20.14.50/index.php/tec_marcha/article/view/6463Classificationmusicradio broadcastingspeech
spellingShingle Joseline Sánchez-Solís
Marvin Coto-Jiménez
Comparison of four classifiers for speech-music discrimination: a first case study for costa rican radio broadcasting
Tecnología en Marcha
Classification
music
radio broadcasting
speech
title Comparison of four classifiers for speech-music discrimination: a first case study for costa rican radio broadcasting
title_full Comparison of four classifiers for speech-music discrimination: a first case study for costa rican radio broadcasting
title_fullStr Comparison of four classifiers for speech-music discrimination: a first case study for costa rican radio broadcasting
title_full_unstemmed Comparison of four classifiers for speech-music discrimination: a first case study for costa rican radio broadcasting
title_short Comparison of four classifiers for speech-music discrimination: a first case study for costa rican radio broadcasting
title_sort comparison of four classifiers for speech music discrimination a first case study for costa rican radio broadcasting
topic Classification
music
radio broadcasting
speech
url https://172.20.14.50/index.php/tec_marcha/article/view/6463
work_keys_str_mv AT joselinesanchezsolis comparisonoffourclassifiersforspeechmusicdiscriminationafirstcasestudyforcostaricanradiobroadcasting
AT marvincotojimenez comparisonoffourclassifiersforspeechmusicdiscriminationafirstcasestudyforcostaricanradiobroadcasting