Single-ended quality measurement of a music content via convolutional recurrent neural networks
The paper examines the usage of Convolutional Bidirectional Recurrent Neural Network (CBRNN) for a problem of quality measurement in a music content. The key contribution in this approach, compared to the existing research, is that the examined model is evaluated in terms of detecting acoustic anoma...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Polish Academy of Sciences
2021-01-01
|
Series: | Metrology and Measurement Systems |
Subjects: | |
Online Access: | https://journals.pan.pl/Content/117865/PDF/art12.pdf |
_version_ | 1818491098909114368 |
---|---|
author | Kamila Organiściak Józef Borkowski |
author_facet | Kamila Organiściak Józef Borkowski |
author_sort | Kamila Organiściak |
collection | DOAJ |
description | The paper examines the usage of Convolutional Bidirectional Recurrent Neural Network (CBRNN) for a problem of quality measurement in a music content. The key contribution in this approach, compared to the existing research, is that the examined model is evaluated in terms of detecting acoustic anomalies without the requirement to provide a reference (clean) signal. Since real music content may include some modes of instrumental sounds, speech and singing voice or different audio effects, it is more complex to analyze than clean speech or artificial signals, especially without a comparison to the known reference content. The presented results might be treated as a proof of concept, since some specific types of artefacts are covered in this paper (examples of quantization defect, missing sound, distortion of gain characteristics, extra noise sound). However, the described model can be easily expanded to detect other impairments or used as a pre-trained model for other transfer learning processes. To examine the model efficiency several experiments have been performed and reported in the paper. The raw audio samples were transformed into Mel-scaled spectrograms and transferred as input to the model, first independently, then along with additional features (Zero Crossing Rate, Spectral Contrast). According to the obtained results, there is a significant increase in overall accuracy (by 10.1%), if Spectral Contrast information is provided together with Mel-scaled spectrograms. The paper examines also the influence of recursive layers on effectiveness of the artefact classification task. |
first_indexed | 2024-12-10T17:25:49Z |
format | Article |
id | doaj.art-be9835fb8b6a43e8ae23c72b165cad5c |
institution | Directory Open Access Journal |
issn | 2300-1941 |
language | English |
last_indexed | 2024-12-10T17:25:49Z |
publishDate | 2021-01-01 |
publisher | Polish Academy of Sciences |
record_format | Article |
series | Metrology and Measurement Systems |
spelling | doaj.art-be9835fb8b6a43e8ae23c72b165cad5c2022-12-22T01:39:51ZengPolish Academy of SciencesMetrology and Measurement Systems2300-19412021-01-01vol. 27No 4721733https://doi.org/10.24425/mms.2020.134849Single-ended quality measurement of a music content via convolutional recurrent neural networksKamila OrganiściakJózef BorkowskiThe paper examines the usage of Convolutional Bidirectional Recurrent Neural Network (CBRNN) for a problem of quality measurement in a music content. The key contribution in this approach, compared to the existing research, is that the examined model is evaluated in terms of detecting acoustic anomalies without the requirement to provide a reference (clean) signal. Since real music content may include some modes of instrumental sounds, speech and singing voice or different audio effects, it is more complex to analyze than clean speech or artificial signals, especially without a comparison to the known reference content. The presented results might be treated as a proof of concept, since some specific types of artefacts are covered in this paper (examples of quantization defect, missing sound, distortion of gain characteristics, extra noise sound). However, the described model can be easily expanded to detect other impairments or used as a pre-trained model for other transfer learning processes. To examine the model efficiency several experiments have been performed and reported in the paper. The raw audio samples were transformed into Mel-scaled spectrograms and transferred as input to the model, first independently, then along with additional features (Zero Crossing Rate, Spectral Contrast). According to the obtained results, there is a significant increase in overall accuracy (by 10.1%), if Spectral Contrast information is provided together with Mel-scaled spectrograms. The paper examines also the influence of recursive layers on effectiveness of the artefact classification task.https://journals.pan.pl/Content/117865/PDF/art12.pdfaudio data analysisartefacts detectionconvolutional neural networksrecurrent neural networksclassification model |
spellingShingle | Kamila Organiściak Józef Borkowski Single-ended quality measurement of a music content via convolutional recurrent neural networks Metrology and Measurement Systems audio data analysis artefacts detection convolutional neural networks recurrent neural networks classification model |
title | Single-ended quality measurement of a music content via convolutional recurrent neural networks |
title_full | Single-ended quality measurement of a music content via convolutional recurrent neural networks |
title_fullStr | Single-ended quality measurement of a music content via convolutional recurrent neural networks |
title_full_unstemmed | Single-ended quality measurement of a music content via convolutional recurrent neural networks |
title_short | Single-ended quality measurement of a music content via convolutional recurrent neural networks |
title_sort | single ended quality measurement of a music content via convolutional recurrent neural networks |
topic | audio data analysis artefacts detection convolutional neural networks recurrent neural networks classification model |
url | https://journals.pan.pl/Content/117865/PDF/art12.pdf |
work_keys_str_mv | AT kamilaorganisciak singleendedqualitymeasurementofamusiccontentviaconvolutionalrecurrentneuralnetworks AT jozefborkowski singleendedqualitymeasurementofamusiccontentviaconvolutionalrecurrentneuralnetworks |