Extending Radio Broadcasting Semantics through Adaptive Audio Segmentation Automations

The present paper focuses on adaptive audio detection, segmentation and classification techniques in audio broadcasting content, dedicated mainly to voice data. The suggested framework addresses a real case scenario encountered in media services and especially radio streams, aiming to fulfill divers...

Full description

Bibliographic Details
Main Authors:	Rigas Kotsakis, Charalampos Dimoulas
Format:	Article
Language:	English
Published:	MDPI AG 2022-07-01
Series:	Knowledge
Subjects:	audio semantics content analysis radio broadcasting
Online Access:	https://www.mdpi.com/2673-9585/2/3/20

_version_	1797464868526227456
author	Rigas Kotsakis Charalampos Dimoulas
author_facet	Rigas Kotsakis Charalampos Dimoulas
author_sort	Rigas Kotsakis
collection	DOAJ
description	The present paper focuses on adaptive audio detection, segmentation and classification techniques in audio broadcasting content, dedicated mainly to voice data. The suggested framework addresses a real case scenario encountered in media services and especially radio streams, aiming to fulfill diverse (semi-) automated indexing/annotation and management necessities. In this context, aggregated radio content is collected, featuring small input datasets, which are utilized for adaptive classification experiments, without searching, at this point, for a generic pattern recognition solution. Hierarchical and hybrid taxonomies are proposed, firstly to discriminate voice data in radio streams and thereafter to detect single speaker voices, and when this is the case, the experiments proceed into a final layer of gender classification. It is worth mentioning that stand-alone and combined supervised and clustering techniques are tested along with multivariate window tuning, towards the extraction of meaningful results based on overall and partial performance rates. Furthermore, the current work via data augmentation mechanisms contributes to the formulation of a dynamic Generic Audio Classification Repository to be subjected, in the future, to adaptive multilabel experimentation with more sophisticated techniques, such as deep architectures.
first_indexed	2024-03-09T18:14:16Z
format	Article
id	doaj.art-ee3b721d736144a7953d10805796227f
institution	Directory Open Access Journal
issn	2673-9585
language	English
last_indexed	2024-03-09T18:14:16Z
publishDate	2022-07-01
publisher	MDPI AG
record_format	Article
series	Knowledge
spelling	doaj.art-ee3b721d736144a7953d10805796227f2023-11-24T08:55:07ZengMDPI AGKnowledge2673-95852022-07-012334736410.3390/knowledge2030020Extending Radio Broadcasting Semantics through Adaptive Audio Segmentation AutomationsRigas Kotsakis0Charalampos Dimoulas1Department of Information and Electronic Engineering, International Hellenic University, 57001 Thessaloniki, GreeceSchool of Journalism & Mass Communications, Aristotle University, 54124 Thessaloniki, GreeceThe present paper focuses on adaptive audio detection, segmentation and classification techniques in audio broadcasting content, dedicated mainly to voice data. The suggested framework addresses a real case scenario encountered in media services and especially radio streams, aiming to fulfill diverse (semi-) automated indexing/annotation and management necessities. In this context, aggregated radio content is collected, featuring small input datasets, which are utilized for adaptive classification experiments, without searching, at this point, for a generic pattern recognition solution. Hierarchical and hybrid taxonomies are proposed, firstly to discriminate voice data in radio streams and thereafter to detect single speaker voices, and when this is the case, the experiments proceed into a final layer of gender classification. It is worth mentioning that stand-alone and combined supervised and clustering techniques are tested along with multivariate window tuning, towards the extraction of meaningful results based on overall and partial performance rates. Furthermore, the current work via data augmentation mechanisms contributes to the formulation of a dynamic Generic Audio Classification Repository to be subjected, in the future, to adaptive multilabel experimentation with more sophisticated techniques, such as deep architectures.https://www.mdpi.com/2673-9585/2/3/20audio semanticscontent analysisradio broadcasting
spellingShingle	Rigas Kotsakis Charalampos Dimoulas Extending Radio Broadcasting Semantics through Adaptive Audio Segmentation Automations Knowledge audio semantics content analysis radio broadcasting
title	Extending Radio Broadcasting Semantics through Adaptive Audio Segmentation Automations
title_full	Extending Radio Broadcasting Semantics through Adaptive Audio Segmentation Automations
title_fullStr	Extending Radio Broadcasting Semantics through Adaptive Audio Segmentation Automations
title_full_unstemmed	Extending Radio Broadcasting Semantics through Adaptive Audio Segmentation Automations
title_short	Extending Radio Broadcasting Semantics through Adaptive Audio Segmentation Automations
title_sort	extending radio broadcasting semantics through adaptive audio segmentation automations
topic	audio semantics content analysis radio broadcasting
url	https://www.mdpi.com/2673-9585/2/3/20
work_keys_str_mv	AT rigaskotsakis extendingradiobroadcastingsemanticsthroughadaptiveaudiosegmentationautomations AT charalamposdimoulas extendingradiobroadcastingsemanticsthroughadaptiveaudiosegmentationautomations

Extending Radio Broadcasting Semantics through Adaptive Audio Segmentation Automations

Similar Items