🤖 AI Summary
This paper addresses the challenge of balancing formula diversity and accuracy in symbolic regression by proposing the first diffusion-based equation generation framework. Methodologically, it introduces a stochastic masked diffusion-denoising mechanism for structured symbolic expression generation and synergistically combines token-level Grouped Relative Policy Optimization (GRPO) with a long-term risk-seeking reinforcement learning strategy to search for optimal solutions. Key contributions include: (1) the first application of diffusion models to symbolic regression, breaking away from conventional tree- or sequence-based generation paradigms; (2) the novel integration of GRPO and risk-seeking exploration, substantially enhancing both formula diversity and generalization capability; and (3) consistent state-of-the-art performance across multiple benchmark datasets, with ablation studies confirming the effectiveness of each component.
📝 Abstract
Diffusion has emerged as a powerful framework for generative modeling, achieving remarkable success in applications such as image and audio synthesis. Enlightened by this progress, we propose a novel diffusion-based approach for symbolic regression. We construct a random mask-based diffusion and denoising process to generate diverse and high-quality equations. We integrate this generative processes with a token-wise Group Relative Policy Optimization (GRPO) method to conduct efficient reinforcement learning on the given measurement dataset. In addition, we introduce a long short-term risk-seeking policy to expand the pool of top-performing candidates, further enhancing performance. Extensive experiments and ablation studies have demonstrated the effectiveness of our approach.