Multiclass audio segmentation based on recurrent neural networks for broadcast domain data
Abstract This paper presents a new approach based on recurrent neural networks (RNN) to the multiclass audio segmentation task whose goal is to classify an audio signal as speech, music, noise or a combination of these. The proposed system is based on the use of bidirectional long short-term Memory...
Main Authors: | Pablo Gimeno, Ignacio Viñals, Alfonso Ortega, Antonio Miguel, Eduardo Lleida |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2020-03-01
|
Series: | EURASIP Journal on Audio, Speech, and Music Processing |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s13636-020-00172-6 |
Similar Items
-
Digital audio broadcasting : principles and applications of DAB, DAB+ and DMB /
by: Hoeg, Wolfgang, et al.
Published: (2009) -
Extending Radio Broadcasting Semantics through Adaptive Audio Segmentation Automations
by: Rigas Kotsakis, et al.
Published: (2022-07-01) -
Practical Study of Recurrent Neural Networks for Efficient Real-Time Drone Sound Detection: A Review
by: Dana Utebayeva, et al.
Published: (2022-12-01) -
Deep Learning Approach for Detecting Audio Deepfakes in Urdu
by: Marium Mateen
Published: (2023-07-01) -
Audio-Visual Stress Classification Using Cascaded RNN-LSTM Networks
by: Megha V. Gupta, et al.
Published: (2022-09-01)