๐ค AI Summary
This study addresses the inefficiency and non-computability of manual Schenkerian analysis in hierarchical music analysis of Baroque fugue subjects. We propose the first end-to-end, automated hierarchical analysis framework for multi-relational music graphs. Methodologically, we introduce a novel differentiable graph pooling mechanism based on node isolation, integrated with multi-relational graph representation learning and a hierarchical graph neural network, enabling joint learning from note-level inputs to structural-level representations. Evaluated on a fugue subject dataset, our model achieves expert-level analytical accuracy while substantially improving efficiency, consistency, and reproducibility. The learned hierarchical representations are semantically grounded and computationally tractable, providing an interpretable and scalable structural foundation for downstream tasksโincluding music understanding, generation, and pedagogy.
๐ Abstract
Hierarchical representations provide powerful and principled approaches for analyzing many musical genres. Such representations have been broadly studied in music theory, for instance via Schenkerian analysis (SchA). Hierarchical music analyses, however, are highly cost-intensive; the analysis of a single piece of music requires a great deal of time and effort from trained experts. The representation of hierarchical analyses in a computer-readable format is a further challenge. Given recent developments in hierarchical deep learning and increasing quantities of computer-readable data, there is great promise in extending such work for an automatic hierarchical representation framework. This paper thus introduces a novel approach, AutoSchA, which extends recent developments in graph neural networks (GNNs) for hierarchical music analysis. AutoSchA features three key contributions: 1) a new graph learning framework for hierarchical music representation, 2) a new graph pooling mechanism based on node isolation that directly optimizes learned pooling assignments, and 3) a state-of-the-art architecture that integrates such developments for automatic hierarchical music analysis. We show, in a suite of experiments, that AutoSchA performs comparably to human experts when analyzing Baroque fugue subjects.