The Rest is Silence: Leveraging Unseen Species Models for Computational Musicology

📅 2025-07-19
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Music scholarship frequently confronts incomplete historical data—e.g., missing composers in the RISM database, low catalog coverage of Gregorian chant, unquantified variant readings across musical editions, underrepresentation of folk repertoires, and uncertainty regarding the scale of harmonic vocabularies. This paper introduces, for the first time in musicology, unseen species models (USMs) from ecology, integrating frequency distribution modeling with interpolation techniques to establish a novel statistical inference paradigm for incomplete music文献. Four empirical case studies systematically estimate: (1) the number of undocumented composers; (2) medieval chant catalog coverage; (3) expected numbers of variant readings between editions; and (4) the aggregate size of harmonic vocabularies across multiple composers. Moving beyond descriptive statistics, this approach enables verifiable, reproducible quantification of “silent” or unobserved portions of the musical record, offering both a theoretical framework and practical methodology for diagnosing data completeness and enriching metadata in historical music research.

Technology Category

Application Category

📝 Abstract
For many decades, musicologists have engaged in creating large databases serving different purposes for musicological research and scholarship. With the rise of fields like music information retrieval and digital musicology, there is now a constant and growing influx of musicologically relevant datasets and corpora. In historical or observational settings, however, these datasets are necessarily incomplete, and the true extent of a collection of interest remains unknown -- silent. Here, we apply, for the first time, so-called Unseen Species models (USMs) from ecology to areas of musicological activity. After introducing the models formally, we show in four case studies how USMs can be applied to musicological data to address quantitative questions like: How many composers are we missing in RISM? What percentage of medieval sources of Gregorian chant have we already cataloged? How many differences in music prints do we expect to find between editions? How large is the coverage of songs from genres of a folk music tradition? And, finally, how close are we in estimating the size of the harmonic vocabulary of a large number of composers?
Problem

Research questions and friction points this paper is trying to address.

Estimating missing composers in historical music databases
Assessing catalog completeness of medieval chant sources
Measuring coverage of folk music tradition genres
Innovation

Methods, ideas, or system contributions that make the work stand out.

Applying Unseen Species models to musicology
Estimating missing composers in music databases
Quantifying coverage of historical music sources
🔎 Similar Papers
No similar papers found.