Algorithm Discovery With LLMs: Evolutionary Search Meets Reinforcement Learning

📅 2025-04-07

📈 Citations: 0

✨ Influential: 0

career value

232K/year

🤖 AI Summary

Large language models (LLMs) are typically employed as static generators in automated algorithm discovery, lacking mechanisms to dynamically refine their behavior using feedback from evolutionary search. Method: We propose an RL-augmented evolutionary search framework that integrates Proximal Policy Optimization (PPO)—a policy gradient reinforcement learning method—into an LLM-driven evolutionary closed loop. Within this framework, the LLM serves as an evolvable search operator, iteratively improving its algorithmic proposals for combinatorial optimization tasks (bin packing, TSP, Flatpack) via reinforcement signals derived from solution quality. Contribution/Results: This work presents the first end-to-end joint optimization of RL and LLM-based evolutionary search, moving beyond conventional prompt engineering paradigms. Experiments demonstrate substantial improvements in both the efficiency of discovering high-performing novel algorithms and their generalization across problem instances, validating the effectiveness and scalability of dynamic model evolution for automated algorithm design.

Technology Category

Application Category

📝 Abstract

Discovering efficient algorithms for solving complex problems has been an outstanding challenge in mathematics and computer science, requiring substantial human expertise over the years. Recent advancements in evolutionary search with large language models (LLMs) have shown promise in accelerating the discovery of algorithms across various domains, particularly in mathematics and optimization. However, existing approaches treat the LLM as a static generator, missing the opportunity to update the model with the signal obtained from evolutionary exploration. In this work, we propose to augment LLM-based evolutionary search by continuously refining the search operator - the LLM - through reinforcement learning (RL) fine-tuning. Our method leverages evolutionary search as an exploration strategy to discover improved algorithms, while RL optimizes the LLM policy based on these discoveries. Our experiments on three combinatorial optimization tasks - bin packing, traveling salesman, and the flatpack problem - show that combining RL and evolutionary search improves discovery efficiency of improved algorithms, showcasing the potential of RL-enhanced evolutionary strategies to assist computer scientists and mathematicians for more efficient algorithm design.

Problem

Research questions and friction points this paper is trying to address.

Enhancing algorithm discovery via LLM-based evolutionary search and RL fine-tuning

Improving efficiency in solving combinatorial optimization problems like bin packing

Updating LLMs dynamically using evolutionary exploration signals for better algorithms

Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM-based evolutionary search with RL fine-tuning

Continuous refinement of LLM via reinforcement learning

Enhanced algorithm discovery efficiency in optimization tasks

🔎 Similar Papers

EvoPrompt: Connecting LLMs with Evolutionary Algorithms Yields Powerful Prompt Optimizers