Single-ended quality measurement of a music content via convolutional recurrent neural networks

The paper examines the usage of Convolutional Bidirectional Recurrent Neural Network (CBRNN) for a problem of quality measurement in a music content. The key contribution in this approach, compared to the existing research, is that the examined model is evaluated in terms of detecting acoustic anoma...

Full description

Bibliographic Details
Main Authors: Kamila Organiściak, Józef Borkowski
Format: Article
Language:English
Published: Polish Academy of Sciences 2021-01-01
Series:Metrology and Measurement Systems
Subjects:
Online Access:https://journals.pan.pl/Content/117865/PDF/art12.pdf
_version_ 1818491098909114368
author Kamila Organiściak
Józef Borkowski
author_facet Kamila Organiściak
Józef Borkowski
author_sort Kamila Organiściak
collection DOAJ
description The paper examines the usage of Convolutional Bidirectional Recurrent Neural Network (CBRNN) for a problem of quality measurement in a music content. The key contribution in this approach, compared to the existing research, is that the examined model is evaluated in terms of detecting acoustic anomalies without the requirement to provide a reference (clean) signal. Since real music content may include some modes of instrumental sounds, speech and singing voice or different audio effects, it is more complex to analyze than clean speech or artificial signals, especially without a comparison to the known reference content. The presented results might be treated as a proof of concept, since some specific types of artefacts are covered in this paper (examples of quantization defect, missing sound, distortion of gain characteristics, extra noise sound). However, the described model can be easily expanded to detect other impairments or used as a pre-trained model for other transfer learning processes. To examine the model efficiency several experiments have been performed and reported in the paper. The raw audio samples were transformed into Mel-scaled spectrograms and transferred as input to the model, first independently, then along with additional features (Zero Crossing Rate, Spectral Contrast). According to the obtained results, there is a significant increase in overall accuracy (by 10.1%), if Spectral Contrast information is provided together with Mel-scaled spectrograms. The paper examines also the influence of recursive layers on effectiveness of the artefact classification task.
first_indexed 2024-12-10T17:25:49Z
format Article
id doaj.art-be9835fb8b6a43e8ae23c72b165cad5c
institution Directory Open Access Journal
issn 2300-1941
language English
last_indexed 2024-12-10T17:25:49Z
publishDate 2021-01-01
publisher Polish Academy of Sciences
record_format Article
series Metrology and Measurement Systems
spelling doaj.art-be9835fb8b6a43e8ae23c72b165cad5c2022-12-22T01:39:51ZengPolish Academy of SciencesMetrology and Measurement Systems2300-19412021-01-01vol. 27No 4721733https://doi.org/10.24425/mms.2020.134849Single-ended quality measurement of a music content via convolutional recurrent neural networksKamila OrganiściakJózef BorkowskiThe paper examines the usage of Convolutional Bidirectional Recurrent Neural Network (CBRNN) for a problem of quality measurement in a music content. The key contribution in this approach, compared to the existing research, is that the examined model is evaluated in terms of detecting acoustic anomalies without the requirement to provide a reference (clean) signal. Since real music content may include some modes of instrumental sounds, speech and singing voice or different audio effects, it is more complex to analyze than clean speech or artificial signals, especially without a comparison to the known reference content. The presented results might be treated as a proof of concept, since some specific types of artefacts are covered in this paper (examples of quantization defect, missing sound, distortion of gain characteristics, extra noise sound). However, the described model can be easily expanded to detect other impairments or used as a pre-trained model for other transfer learning processes. To examine the model efficiency several experiments have been performed and reported in the paper. The raw audio samples were transformed into Mel-scaled spectrograms and transferred as input to the model, first independently, then along with additional features (Zero Crossing Rate, Spectral Contrast). According to the obtained results, there is a significant increase in overall accuracy (by 10.1%), if Spectral Contrast information is provided together with Mel-scaled spectrograms. The paper examines also the influence of recursive layers on effectiveness of the artefact classification task.https://journals.pan.pl/Content/117865/PDF/art12.pdfaudio data analysisartefacts detectionconvolutional neural networksrecurrent neural networksclassification model
spellingShingle Kamila Organiściak
Józef Borkowski
Single-ended quality measurement of a music content via convolutional recurrent neural networks
Metrology and Measurement Systems
audio data analysis
artefacts detection
convolutional neural networks
recurrent neural networks
classification model
title Single-ended quality measurement of a music content via convolutional recurrent neural networks
title_full Single-ended quality measurement of a music content via convolutional recurrent neural networks
title_fullStr Single-ended quality measurement of a music content via convolutional recurrent neural networks
title_full_unstemmed Single-ended quality measurement of a music content via convolutional recurrent neural networks
title_short Single-ended quality measurement of a music content via convolutional recurrent neural networks
title_sort single ended quality measurement of a music content via convolutional recurrent neural networks
topic audio data analysis
artefacts detection
convolutional neural networks
recurrent neural networks
classification model
url https://journals.pan.pl/Content/117865/PDF/art12.pdf
work_keys_str_mv AT kamilaorganisciak singleendedqualitymeasurementofamusiccontentviaconvolutionalrecurrentneuralnetworks
AT jozefborkowski singleendedqualitymeasurementofamusiccontentviaconvolutionalrecurrentneuralnetworks