Insertion Based Sequence Generation with Learnable Order Dynamics

πŸ“… 2026-02-20
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the challenge of training insertion-based sequence generation models, which suffer from an intractably large action space. The authors propose a novel approach that integrates a learnable sequential dynamics mechanism into the target rate of discrete flow matching, enabling joint optimization of the generator and the insertion order policy without requiring numerical simulation, thereby facilitating end-to-end training. The framework employs a variable-length masked diffusion model as the generator, achieving for the first time flexible and stable order modeling in insertion-based generation. Experiments on de novo small molecule generation demonstrate significant improvements over uniform ordering strategies in both the quantity of valid molecules and overall generation quality. Further graph traversal studies elucidate the performance trade-offs among different parameterized order policies.

Technology Category

Application Category

πŸ“ Abstract
In many domains generating variable length sequences through insertions provides greater flexibility over autoregressive models. However, the action space of insertion models is much larger than that of autoregressive models (ARMs) making the learning challenging. To address this, we incorporate trainable order dynamics into the target rates for discrete flow matching, and show that with suitable choices of parameterizations, joint training of the target order dynamics and the generator is tractable without the need for numerical simulation. As the generative insertion model, we use a variable length masked diffusion model, which generates by inserting and filling mask tokens. On graph traversal tasks for which a locally optimal insertion order is known, we explore the choices of parameterization empirically and demonstrate the trade-offs between flexibility, training stability and generation quality. On de novo small molecule generation, we find that the learned order dynamics leads to an increase in the number of valid molecules generated and improved quality, when compared to uniform order dynamics.
Problem

Research questions and friction points this paper is trying to address.

insertion-based sequence generation
action space
learnable order dynamics
discrete flow matching
variable length sequences
Innovation

Methods, ideas, or system contributions that make the work stand out.

insertion-based generation
learnable order dynamics
discrete flow matching
masked diffusion model
variable-length sequence generation
πŸ”Ž Similar Papers
No similar papers found.