Similarity of Musical Timbres Using FFT-Acoustic Descriptor Analysis and Machine Learning

Musical timbre is a phenomenon of auditory perception that allows the recognition of musical sounds. The recognition of musical timbre is a challenging task because the timbre of a musical instrument or sound source is a complex and multifaceted phenomenon that is influenced by a variety of factors,...

Full description

Bibliographic Details
Main Authors: Yubiry Gonzalez, Ronaldo C. Prati
Format: Article
Language:English
Published: MDPI AG 2023-02-01
Series:Eng
Subjects:
Online Access:https://www.mdpi.com/2673-4117/4/1/33
_version_ 1827749999834824704
author Yubiry Gonzalez
Ronaldo C. Prati
author_facet Yubiry Gonzalez
Ronaldo C. Prati
author_sort Yubiry Gonzalez
collection DOAJ
description Musical timbre is a phenomenon of auditory perception that allows the recognition of musical sounds. The recognition of musical timbre is a challenging task because the timbre of a musical instrument or sound source is a complex and multifaceted phenomenon that is influenced by a variety of factors, including the physical properties of the instrument or sound source, the way it is played or produced, and the recording and processing techniques used. In this paper, we explore an abstract space with 7 dimensions formed by the fundamental frequency and FFT-Acoustic Descriptors in 240 monophonic sounds from the Tinysol and Good-Sounds databases, corresponding to the fourth octave of the transverse flute and clarinet. This approach allows us to unequivocally define a collection of points and, therefore, a timbral space (Category Theory) that allows different sounds of any type of musical instrument with its respective dynamics to be represented as a single characteristic vector. The geometric distance would allow studying the timbral similarity between audios of different sounds and instruments or between different musical dynamics and datasets. Additionally, a Machine-Learning algorithm that evaluates timbral similarities through Euclidean distances in the abstract space of 7 dimensions was proposed. We conclude that the study of timbral similarity through geometric distances allowed us to distinguish between audio categories of different sounds and musical instruments, between the same type of sound and an instrument with different relative dynamics, and between different datasets.
first_indexed 2024-03-11T06:36:25Z
format Article
id doaj.art-f461a82f13794e8e8191d4c4fbb4fb5e
institution Directory Open Access Journal
issn 2673-4117
language English
last_indexed 2024-03-11T06:36:25Z
publishDate 2023-02-01
publisher MDPI AG
record_format Article
series Eng
spelling doaj.art-f461a82f13794e8e8191d4c4fbb4fb5e2023-11-17T10:53:27ZengMDPI AGEng2673-41172023-02-014155556810.3390/eng4010033Similarity of Musical Timbres Using FFT-Acoustic Descriptor Analysis and Machine LearningYubiry Gonzalez0Ronaldo C. Prati1Center of Mathematics, Computer Science, and Cognition, Federal University of ABC, Av. Dos Estados, 5001, Santo André 09210-580, SP, BrazilCenter of Mathematics, Computer Science, and Cognition, Federal University of ABC, Av. Dos Estados, 5001, Santo André 09210-580, SP, BrazilMusical timbre is a phenomenon of auditory perception that allows the recognition of musical sounds. The recognition of musical timbre is a challenging task because the timbre of a musical instrument or sound source is a complex and multifaceted phenomenon that is influenced by a variety of factors, including the physical properties of the instrument or sound source, the way it is played or produced, and the recording and processing techniques used. In this paper, we explore an abstract space with 7 dimensions formed by the fundamental frequency and FFT-Acoustic Descriptors in 240 monophonic sounds from the Tinysol and Good-Sounds databases, corresponding to the fourth octave of the transverse flute and clarinet. This approach allows us to unequivocally define a collection of points and, therefore, a timbral space (Category Theory) that allows different sounds of any type of musical instrument with its respective dynamics to be represented as a single characteristic vector. The geometric distance would allow studying the timbral similarity between audios of different sounds and instruments or between different musical dynamics and datasets. Additionally, a Machine-Learning algorithm that evaluates timbral similarities through Euclidean distances in the abstract space of 7 dimensions was proposed. We conclude that the study of timbral similarity through geometric distances allowed us to distinguish between audio categories of different sounds and musical instruments, between the same type of sound and an instrument with different relative dynamics, and between different datasets.https://www.mdpi.com/2673-4117/4/1/33musical timbreFFTmusical instrumentsacoustic descriptorsmachine learningdata analysis
spellingShingle Yubiry Gonzalez
Ronaldo C. Prati
Similarity of Musical Timbres Using FFT-Acoustic Descriptor Analysis and Machine Learning
Eng
musical timbre
FFT
musical instruments
acoustic descriptors
machine learning
data analysis
title Similarity of Musical Timbres Using FFT-Acoustic Descriptor Analysis and Machine Learning
title_full Similarity of Musical Timbres Using FFT-Acoustic Descriptor Analysis and Machine Learning
title_fullStr Similarity of Musical Timbres Using FFT-Acoustic Descriptor Analysis and Machine Learning
title_full_unstemmed Similarity of Musical Timbres Using FFT-Acoustic Descriptor Analysis and Machine Learning
title_short Similarity of Musical Timbres Using FFT-Acoustic Descriptor Analysis and Machine Learning
title_sort similarity of musical timbres using fft acoustic descriptor analysis and machine learning
topic musical timbre
FFT
musical instruments
acoustic descriptors
machine learning
data analysis
url https://www.mdpi.com/2673-4117/4/1/33
work_keys_str_mv AT yubirygonzalez similarityofmusicaltimbresusingfftacousticdescriptoranalysisandmachinelearning
AT ronaldocprati similarityofmusicaltimbresusingfftacousticdescriptoranalysisandmachinelearning