Explicit Tonal Tension Conditioning via Dual-Level Beam Search for Symbolic Music Generation

📅 2025-11-24

📈 Citations: 0

✨ Influential: 0

career value

247K/year

🤖 AI Summary

This study addresses the challenge of explicit tonal tension control in symbolic music generation. We propose a tension-driven, two-level beam search framework that integrates an interval-vector-based tension computation model with a Transformer architecture. Joint optimization is performed at both the token level (to preserve diversity) and the bar level (to align generated outputs with a target tension curve), augmented by a tension-guided re-ranking mechanism. To our knowledge, this is the first approach enabling intuitive, precise, and interpretable control of tonal tension during AI composition—while supporting diverse stylistic realizations under identical tension contours. Objective evaluation demonstrates significantly improved tension regulation accuracy; subjective listening tests confirm that generated pieces exhibit both consistent tension evolution and high musical quality. Our work establishes a novel paradigm for controllable music generation.

Technology Category

Application Category

📝 Abstract

State-of-the-art symbolic music generation models have recently achieved remarkable output quality, yet explicit control over compositional features, such as tonal tension, remains challenging. We propose a novel approach that integrates a computational tonal tension model, based on tonal interval vector analysis, into a Transformer framework. Our method employs a two-level beam search strategy during inference. At the token level, generated candidates are re-ranked using model probability and diversity metrics to maintain overall quality. At the bar level, a tension-based re-ranking is applied to ensure that the generated music aligns with a desired tension curve. Objective evaluations indicate that our approach effectively modulates tonal tension, and subjective listening tests confirm that the system produces outputs that align with the target tension. These results demonstrate that explicit tension conditioning through a dual-level beam search provides a powerful and intuitive tool to guide AI-generated music. Furthermore, our experiments demonstrate that our method can generate multiple distinct musical interpretations under the same tension condition.

Problem

Research questions and friction points this paper is trying to address.

Explicit control over tonal tension in symbolic music generation

Integrating computational tension model with Transformer framework

Ensuring generated music aligns with desired tension curves

Innovation

Methods, ideas, or system contributions that make the work stand out.

Transformer framework with tonal tension model

Two-level beam search for music generation

Tension-based re-ranking aligns with target curve

🔎 Similar Papers

MusicMamba: A Dual-Feature Modeling Approach for Generating Chinese Traditional Music with Modal Precision