Heaps’ law and vocabulary richness in the history of classical music harmony
Abstract Music is a fundamental human construct, and harmony provides the building blocks of musical language. Using the Kunstderfuge corpus of classical music, we analyze the historical evolution of the richness of harmonic vocabulary of 76 classical composers, covering almost 6 centuries. Such cor...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2021-08-01
|
Series: | EPJ Data Science |
Subjects: | |
Online Access: | https://doi.org/10.1140/epjds/s13688-021-00293-8 |
_version_ | 1818435268452024320 |
---|---|
author | Marc Serra-Peralta Joan Serrà Álvaro Corral |
author_facet | Marc Serra-Peralta Joan Serrà Álvaro Corral |
author_sort | Marc Serra-Peralta |
collection | DOAJ |
description | Abstract Music is a fundamental human construct, and harmony provides the building blocks of musical language. Using the Kunstderfuge corpus of classical music, we analyze the historical evolution of the richness of harmonic vocabulary of 76 classical composers, covering almost 6 centuries. Such corpus comprises about 9500 pieces, resulting in more than 5 million tokens of music codewords. The fulfilment of Heaps’ law for the relation between the size of the harmonic vocabulary of a composer (in codeword types) and the total length of his works (in codeword tokens), with an exponent around 0.35, allows us to define a relative measure of vocabulary richness that has a transparent interpretation. When coupled with the considered corpus, this measure allows us to quantify harmony richness across centuries, unveiling a clear increasing linear trend. In this way, we are able to rank the composers in terms of richness of vocabulary, in the same way as for other related metrics, such as entropy. We find that the latter is particularly highly correlated with our measure of richness. Our approach is not specific for music and can be applied to other systems built by tokens of different types, as for instance natural language. |
first_indexed | 2024-12-14T16:50:11Z |
format | Article |
id | doaj.art-58297e8c4a254ba2b663361106ab7a40 |
institution | Directory Open Access Journal |
issn | 2193-1127 |
language | English |
last_indexed | 2024-12-14T16:50:11Z |
publishDate | 2021-08-01 |
publisher | SpringerOpen |
record_format | Article |
series | EPJ Data Science |
spelling | doaj.art-58297e8c4a254ba2b663361106ab7a402022-12-21T22:54:04ZengSpringerOpenEPJ Data Science2193-11272021-08-0110111710.1140/epjds/s13688-021-00293-8Heaps’ law and vocabulary richness in the history of classical music harmonyMarc Serra-Peralta0Joan Serrà1Álvaro Corral2Centre de Recerca MatemàticaDolby LaboratoriesCentre de Recerca MatemàticaAbstract Music is a fundamental human construct, and harmony provides the building blocks of musical language. Using the Kunstderfuge corpus of classical music, we analyze the historical evolution of the richness of harmonic vocabulary of 76 classical composers, covering almost 6 centuries. Such corpus comprises about 9500 pieces, resulting in more than 5 million tokens of music codewords. The fulfilment of Heaps’ law for the relation between the size of the harmonic vocabulary of a composer (in codeword types) and the total length of his works (in codeword tokens), with an exponent around 0.35, allows us to define a relative measure of vocabulary richness that has a transparent interpretation. When coupled with the considered corpus, this measure allows us to quantify harmony richness across centuries, unveiling a clear increasing linear trend. In this way, we are able to rank the composers in terms of richness of vocabulary, in the same way as for other related metrics, such as entropy. We find that the latter is particularly highly correlated with our measure of richness. Our approach is not specific for music and can be applied to other systems built by tokens of different types, as for instance natural language.https://doi.org/10.1140/epjds/s13688-021-00293-8Heaps’ lawEntropyMIDI scoresHarmonic richnessCulturomics |
spellingShingle | Marc Serra-Peralta Joan Serrà Álvaro Corral Heaps’ law and vocabulary richness in the history of classical music harmony EPJ Data Science Heaps’ law Entropy MIDI scores Harmonic richness Culturomics |
title | Heaps’ law and vocabulary richness in the history of classical music harmony |
title_full | Heaps’ law and vocabulary richness in the history of classical music harmony |
title_fullStr | Heaps’ law and vocabulary richness in the history of classical music harmony |
title_full_unstemmed | Heaps’ law and vocabulary richness in the history of classical music harmony |
title_short | Heaps’ law and vocabulary richness in the history of classical music harmony |
title_sort | heaps law and vocabulary richness in the history of classical music harmony |
topic | Heaps’ law Entropy MIDI scores Harmonic richness Culturomics |
url | https://doi.org/10.1140/epjds/s13688-021-00293-8 |
work_keys_str_mv | AT marcserraperalta heapslawandvocabularyrichnessinthehistoryofclassicalmusicharmony AT joanserra heapslawandvocabularyrichnessinthehistoryofclassicalmusicharmony AT alvarocorral heapslawandvocabularyrichnessinthehistoryofclassicalmusicharmony |