Singer identification model using data augmentation and enhanced feature conversion with hybrid feature vector and machine learning

Abstract Analyzing songs is a problem that is being investigated to aid various operations on music access platforms. At the beginning of these problems is the identification of the person who sings the song. In this study, a singer identification application, which consists of Turkish singers and w...

Full description

Bibliographic Details
Main Authors: Serhat Hizlisoy, Recep Sinan Arslan, Emel Çolakoğlu
Format: Article
Language:English
Published: SpringerOpen 2024-02-01
Series:EURASIP Journal on Audio, Speech, and Music Processing
Subjects:
Online Access:https://doi.org/10.1186/s13636-024-00336-8
_version_ 1797273695695142912
author Serhat Hizlisoy
Recep Sinan Arslan
Emel Çolakoğlu
author_facet Serhat Hizlisoy
Recep Sinan Arslan
Emel Çolakoğlu
author_sort Serhat Hizlisoy
collection DOAJ
description Abstract Analyzing songs is a problem that is being investigated to aid various operations on music access platforms. At the beginning of these problems is the identification of the person who sings the song. In this study, a singer identification application, which consists of Turkish singers and works for the Turkish language, is proposed in order to find a solution to this problem. Mel-spectrogram and octave-based spectral contrast values are extracted from the songs, and these values are combined into a hybrid feature vector. Thus, problem-specific situations such as determining the differences in the voices of the singers and reducing the effects of the year and album differences on the result are discussed. As a result of the tests and systematic evaluations, it has been shown that a certain level of success has been achieved in the determination of the singer who sings the song, and that the song is in a stable structure against the changes in the singing style and song structure. The results were analyzed in a database of 9 singers and 180 songs. An accuracy value of 89.4% was obtained using the reduction of the feature vector by PCA, the normalization of the data, and the Extra Trees classifier. Precision, recall and f-score values were 89.9%, 89.4% and 89.5%, respectively.
first_indexed 2024-03-07T14:47:58Z
format Article
id doaj.art-b70c3c3c102944b1a296485a33ed55c9
institution Directory Open Access Journal
issn 1687-4722
language English
last_indexed 2024-03-07T14:47:58Z
publishDate 2024-02-01
publisher SpringerOpen
record_format Article
series EURASIP Journal on Audio, Speech, and Music Processing
spelling doaj.art-b70c3c3c102944b1a296485a33ed55c92024-03-05T19:52:04ZengSpringerOpenEURASIP Journal on Audio, Speech, and Music Processing1687-47222024-02-012024111310.1186/s13636-024-00336-8Singer identification model using data augmentation and enhanced feature conversion with hybrid feature vector and machine learningSerhat Hizlisoy0Recep Sinan Arslan1Emel Çolakoğlu2Department of Computer Engineering, Faculty of Engineering, Architecture and Design, Kayseri UniversityDepartment of Computer Engineering, Faculty of Engineering, Architecture and Design, Kayseri UniversityCalculated Sciences and Engineering, Graduate School of Education, Kayseri UniversityAbstract Analyzing songs is a problem that is being investigated to aid various operations on music access platforms. At the beginning of these problems is the identification of the person who sings the song. In this study, a singer identification application, which consists of Turkish singers and works for the Turkish language, is proposed in order to find a solution to this problem. Mel-spectrogram and octave-based spectral contrast values are extracted from the songs, and these values are combined into a hybrid feature vector. Thus, problem-specific situations such as determining the differences in the voices of the singers and reducing the effects of the year and album differences on the result are discussed. As a result of the tests and systematic evaluations, it has been shown that a certain level of success has been achieved in the determination of the singer who sings the song, and that the song is in a stable structure against the changes in the singing style and song structure. The results were analyzed in a database of 9 singers and 180 songs. An accuracy value of 89.4% was obtained using the reduction of the feature vector by PCA, the normalization of the data, and the Extra Trees classifier. Precision, recall and f-score values were 89.9%, 89.4% and 89.5%, respectively.https://doi.org/10.1186/s13636-024-00336-8Singer identificationOctave-based spectral contrastMel-frequency cepstral coefficientsExtra Trees classifier
spellingShingle Serhat Hizlisoy
Recep Sinan Arslan
Emel Çolakoğlu
Singer identification model using data augmentation and enhanced feature conversion with hybrid feature vector and machine learning
EURASIP Journal on Audio, Speech, and Music Processing
Singer identification
Octave-based spectral contrast
Mel-frequency cepstral coefficients
Extra Trees classifier
title Singer identification model using data augmentation and enhanced feature conversion with hybrid feature vector and machine learning
title_full Singer identification model using data augmentation and enhanced feature conversion with hybrid feature vector and machine learning
title_fullStr Singer identification model using data augmentation and enhanced feature conversion with hybrid feature vector and machine learning
title_full_unstemmed Singer identification model using data augmentation and enhanced feature conversion with hybrid feature vector and machine learning
title_short Singer identification model using data augmentation and enhanced feature conversion with hybrid feature vector and machine learning
title_sort singer identification model using data augmentation and enhanced feature conversion with hybrid feature vector and machine learning
topic Singer identification
Octave-based spectral contrast
Mel-frequency cepstral coefficients
Extra Trees classifier
url https://doi.org/10.1186/s13636-024-00336-8
work_keys_str_mv AT serhathizlisoy singeridentificationmodelusingdataaugmentationandenhancedfeatureconversionwithhybridfeaturevectorandmachinelearning
AT recepsinanarslan singeridentificationmodelusingdataaugmentationandenhancedfeatureconversionwithhybridfeaturevectorandmachinelearning
AT emelcolakoglu singeridentificationmodelusingdataaugmentationandenhancedfeatureconversionwithhybridfeaturevectorandmachinelearning