Segment-Factorized Full-Song Generation on Symbolic Piano Music

📅 2025-10-07
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the challenge of jointly ensuring structural controllability and musical coherence in fully automated symbolic piano composition. To this end, we propose the Segmented Full-Score (SFS) model, which decomposes a composition into user-defined structural segments (e.g., verse, chorus) and models each segment separately. SFS introduces segmented factorized representations and selective cross-segment attention to preserve long-term structural consistency while enhancing local melodic fluency. Built upon the Transformer architecture, the model is integrated into a web-based piano roll interface, supporting seed-guided generation, customizable structural templates, and real-time interactive editing. Quantitative and qualitative evaluations demonstrate that SFS significantly outperforms existing methods in structural accuracy and melodic coherence for complete piece generation. The system has been deployed as an online human-AI collaborative composition platform.

Technology Category

Application Category

📝 Abstract
We propose the Segmented Full-Song Model (SFS) for symbolic full-song generation. The model accepts a user-provided song structure and an optional short seed segment that anchors the main idea around which the song is developed. By factorizing a song into segments and generating each one through selective attention to related segments, the model achieves higher quality and efficiency compared to prior work. To demonstrate its suitability for human-AI interaction, we further wrap SFS into a web application that enables users to iteratively co-create music on a piano roll with customizable structures and flexible ordering.
Problem

Research questions and friction points this paper is trying to address.

Generating symbolic full-song piano music
Factorizing songs into segments for efficient generation
Enabling human-AI co-creation with customizable structures
Innovation

Methods, ideas, or system contributions that make the work stand out.

Segments generated via selective attention mechanisms
User-defined song structure with optional seed segments
Web app enables interactive co-creation on piano roll
🔎 Similar Papers
No similar papers found.
P
Ping-Yi Chen
National Cheng Kung University
C
Chih-Pin Tan
National Taiwan University
Yi-Hsuan Yang
Yi-Hsuan Yang
National Taiwan University
Music information retrievalMusic GenerationMusic ProcessingMusic AIAffective computing