Heaps’ law and vocabulary richness in the history of classical music harmony

Abstract Music is a fundamental human construct, and harmony provides the building blocks of musical language. Using the Kunstderfuge corpus of classical music, we analyze the historical evolution of the richness of harmonic vocabulary of 76 classical composers, covering almost 6 centuries. Such cor...

Full description

Bibliographic Details
Main Authors: Marc Serra-Peralta, Joan Serrà, Álvaro Corral
Format: Article
Language:English
Published: SpringerOpen 2021-08-01
Series:EPJ Data Science
Subjects:
Online Access:https://doi.org/10.1140/epjds/s13688-021-00293-8
_version_ 1818435268452024320
author Marc Serra-Peralta
Joan Serrà
Álvaro Corral
author_facet Marc Serra-Peralta
Joan Serrà
Álvaro Corral
author_sort Marc Serra-Peralta
collection DOAJ
description Abstract Music is a fundamental human construct, and harmony provides the building blocks of musical language. Using the Kunstderfuge corpus of classical music, we analyze the historical evolution of the richness of harmonic vocabulary of 76 classical composers, covering almost 6 centuries. Such corpus comprises about 9500 pieces, resulting in more than 5 million tokens of music codewords. The fulfilment of Heaps’ law for the relation between the size of the harmonic vocabulary of a composer (in codeword types) and the total length of his works (in codeword tokens), with an exponent around 0.35, allows us to define a relative measure of vocabulary richness that has a transparent interpretation. When coupled with the considered corpus, this measure allows us to quantify harmony richness across centuries, unveiling a clear increasing linear trend. In this way, we are able to rank the composers in terms of richness of vocabulary, in the same way as for other related metrics, such as entropy. We find that the latter is particularly highly correlated with our measure of richness. Our approach is not specific for music and can be applied to other systems built by tokens of different types, as for instance natural language.
first_indexed 2024-12-14T16:50:11Z
format Article
id doaj.art-58297e8c4a254ba2b663361106ab7a40
institution Directory Open Access Journal
issn 2193-1127
language English
last_indexed 2024-12-14T16:50:11Z
publishDate 2021-08-01
publisher SpringerOpen
record_format Article
series EPJ Data Science
spelling doaj.art-58297e8c4a254ba2b663361106ab7a402022-12-21T22:54:04ZengSpringerOpenEPJ Data Science2193-11272021-08-0110111710.1140/epjds/s13688-021-00293-8Heaps’ law and vocabulary richness in the history of classical music harmonyMarc Serra-Peralta0Joan Serrà1Álvaro Corral2Centre de Recerca MatemàticaDolby LaboratoriesCentre de Recerca MatemàticaAbstract Music is a fundamental human construct, and harmony provides the building blocks of musical language. Using the Kunstderfuge corpus of classical music, we analyze the historical evolution of the richness of harmonic vocabulary of 76 classical composers, covering almost 6 centuries. Such corpus comprises about 9500 pieces, resulting in more than 5 million tokens of music codewords. The fulfilment of Heaps’ law for the relation between the size of the harmonic vocabulary of a composer (in codeword types) and the total length of his works (in codeword tokens), with an exponent around 0.35, allows us to define a relative measure of vocabulary richness that has a transparent interpretation. When coupled with the considered corpus, this measure allows us to quantify harmony richness across centuries, unveiling a clear increasing linear trend. In this way, we are able to rank the composers in terms of richness of vocabulary, in the same way as for other related metrics, such as entropy. We find that the latter is particularly highly correlated with our measure of richness. Our approach is not specific for music and can be applied to other systems built by tokens of different types, as for instance natural language.https://doi.org/10.1140/epjds/s13688-021-00293-8Heaps’ lawEntropyMIDI scoresHarmonic richnessCulturomics
spellingShingle Marc Serra-Peralta
Joan Serrà
Álvaro Corral
Heaps’ law and vocabulary richness in the history of classical music harmony
EPJ Data Science
Heaps’ law
Entropy
MIDI scores
Harmonic richness
Culturomics
title Heaps’ law and vocabulary richness in the history of classical music harmony
title_full Heaps’ law and vocabulary richness in the history of classical music harmony
title_fullStr Heaps’ law and vocabulary richness in the history of classical music harmony
title_full_unstemmed Heaps’ law and vocabulary richness in the history of classical music harmony
title_short Heaps’ law and vocabulary richness in the history of classical music harmony
title_sort heaps law and vocabulary richness in the history of classical music harmony
topic Heaps’ law
Entropy
MIDI scores
Harmonic richness
Culturomics
url https://doi.org/10.1140/epjds/s13688-021-00293-8
work_keys_str_mv AT marcserraperalta heapslawandvocabularyrichnessinthehistoryofclassicalmusicharmony
AT joanserra heapslawandvocabularyrichnessinthehistoryofclassicalmusicharmony
AT alvarocorral heapslawandvocabularyrichnessinthehistoryofclassicalmusicharmony