Singer identification model using data augmentation and enhanced feature conversion with hybrid feature vector and machine learning

Abstract Analyzing songs is a problem that is being investigated to aid various operations on music access platforms. At the beginning of these problems is the identification of the person who sings the song. In this study, a singer identification application, which consists of Turkish singers and w...

Full description

Bibliographic Details
Main Authors:	Serhat Hizlisoy, Recep Sinan Arslan, Emel Çolakoğlu
Format:	Article
Language:	English
Published:	SpringerOpen 2024-02-01
Series:	EURASIP Journal on Audio, Speech, and Music Processing
Subjects:	Singer identification Octave-based spectral contrast Mel-frequency cepstral coefficients Extra Trees classifier
Online Access:	https://doi.org/10.1186/s13636-024-00336-8

_version_	1797273695695142912
author	Serhat Hizlisoy Recep Sinan Arslan Emel Çolakoğlu
author_facet	Serhat Hizlisoy Recep Sinan Arslan Emel Çolakoğlu
author_sort	Serhat Hizlisoy
collection	DOAJ
description	Abstract Analyzing songs is a problem that is being investigated to aid various operations on music access platforms. At the beginning of these problems is the identification of the person who sings the song. In this study, a singer identification application, which consists of Turkish singers and works for the Turkish language, is proposed in order to find a solution to this problem. Mel-spectrogram and octave-based spectral contrast values are extracted from the songs, and these values are combined into a hybrid feature vector. Thus, problem-specific situations such as determining the differences in the voices of the singers and reducing the effects of the year and album differences on the result are discussed. As a result of the tests and systematic evaluations, it has been shown that a certain level of success has been achieved in the determination of the singer who sings the song, and that the song is in a stable structure against the changes in the singing style and song structure. The results were analyzed in a database of 9 singers and 180 songs. An accuracy value of 89.4% was obtained using the reduction of the feature vector by PCA, the normalization of the data, and the Extra Trees classifier. Precision, recall and f-score values were 89.9%, 89.4% and 89.5%, respectively.
first_indexed	2024-03-07T14:47:58Z
format	Article
id	doaj.art-b70c3c3c102944b1a296485a33ed55c9
institution	Directory Open Access Journal
issn	1687-4722
language	English
last_indexed	2024-03-07T14:47:58Z
publishDate	2024-02-01
publisher	SpringerOpen
record_format	Article
series	EURASIP Journal on Audio, Speech, and Music Processing
spelling	doaj.art-b70c3c3c102944b1a296485a33ed55c92024-03-05T19:52:04ZengSpringerOpenEURASIP Journal on Audio, Speech, and Music Processing1687-47222024-02-012024111310.1186/s13636-024-00336-8Singer identification model using data augmentation and enhanced feature conversion with hybrid feature vector and machine learningSerhat Hizlisoy0Recep Sinan Arslan1Emel Çolakoğlu2Department of Computer Engineering, Faculty of Engineering, Architecture and Design, Kayseri UniversityDepartment of Computer Engineering, Faculty of Engineering, Architecture and Design, Kayseri UniversityCalculated Sciences and Engineering, Graduate School of Education, Kayseri UniversityAbstract Analyzing songs is a problem that is being investigated to aid various operations on music access platforms. At the beginning of these problems is the identification of the person who sings the song. In this study, a singer identification application, which consists of Turkish singers and works for the Turkish language, is proposed in order to find a solution to this problem. Mel-spectrogram and octave-based spectral contrast values are extracted from the songs, and these values are combined into a hybrid feature vector. Thus, problem-specific situations such as determining the differences in the voices of the singers and reducing the effects of the year and album differences on the result are discussed. As a result of the tests and systematic evaluations, it has been shown that a certain level of success has been achieved in the determination of the singer who sings the song, and that the song is in a stable structure against the changes in the singing style and song structure. The results were analyzed in a database of 9 singers and 180 songs. An accuracy value of 89.4% was obtained using the reduction of the feature vector by PCA, the normalization of the data, and the Extra Trees classifier. Precision, recall and f-score values were 89.9%, 89.4% and 89.5%, respectively.https://doi.org/10.1186/s13636-024-00336-8Singer identificationOctave-based spectral contrastMel-frequency cepstral coefficientsExtra Trees classifier
spellingShingle	Serhat Hizlisoy Recep Sinan Arslan Emel Çolakoğlu Singer identification model using data augmentation and enhanced feature conversion with hybrid feature vector and machine learning EURASIP Journal on Audio, Speech, and Music Processing Singer identification Octave-based spectral contrast Mel-frequency cepstral coefficients Extra Trees classifier
title	Singer identification model using data augmentation and enhanced feature conversion with hybrid feature vector and machine learning
title_full	Singer identification model using data augmentation and enhanced feature conversion with hybrid feature vector and machine learning
title_fullStr	Singer identification model using data augmentation and enhanced feature conversion with hybrid feature vector and machine learning
title_full_unstemmed	Singer identification model using data augmentation and enhanced feature conversion with hybrid feature vector and machine learning
title_short	Singer identification model using data augmentation and enhanced feature conversion with hybrid feature vector and machine learning
title_sort	singer identification model using data augmentation and enhanced feature conversion with hybrid feature vector and machine learning
topic	Singer identification Octave-based spectral contrast Mel-frequency cepstral coefficients Extra Trees classifier
url	https://doi.org/10.1186/s13636-024-00336-8
work_keys_str_mv	AT serhathizlisoy singeridentificationmodelusingdataaugmentationandenhancedfeatureconversionwithhybridfeaturevectorandmachinelearning AT recepsinanarslan singeridentificationmodelusingdataaugmentationandenhancedfeatureconversionwithhybridfeaturevectorandmachinelearning AT emelcolakoglu singeridentificationmodelusingdataaugmentationandenhancedfeatureconversionwithhybridfeaturevectorandmachinelearning

Singer identification model using data augmentation and enhanced feature conversion with hybrid feature vector and machine learning

Similar Items