GenePlan: Evolving Better Generalized PDDL Plans using Large Language Models

📅 2026-03-10

📈 Citations: 0

✨ Influential: 0

career value

193K/year

🤖 AI Summary

This work proposes a novel approach to automatically synthesize efficient and generalizable domain-specific planners for classical PDDL planning tasks. By formulating generalized planning as an optimization problem, the method integrates large language models (e.g., GPT-4o) with evolutionary algorithms to iteratively evolve interpretable Python programs that serve as planners. Evaluated on eight standard domains, the resulting planners achieve an average SAT score of 0.91—approaching the performance of state-of-the-art planners (0.93) and substantially outperforming LLM-based baselines such as chain-of-thought prompting (0.64). Moreover, the synthesized planners solve new instances in just 0.49 seconds on average, with a generation cost of approximately $1.82 per domain, demonstrating superior trade-offs among performance, speed, and cost compared to existing LLM-driven methods.

Technology Category

Application Category

📝 Abstract

We present GenePlan (GENeralized Evolutionary Planner), a novel framework that leverages large language model (LLM) assisted evolutionary algorithms to generate domain-dependent generalized planners for classical planning tasks described in PDDL. By casting generalized planning as an optimization problem, GenePlan iteratively evolves interpretable Python planners that minimize plan length across diverse problem instances. In empirical evaluation across six existing benchmark domains and two new domains, GenePlan achieved an average SAT score of 0.91, closely matching the performance of the state-of-the-art planners (SAT score 0.93), and significantly outperforming other LLM-based baselines such as chain-of-thought (CoT) prompting (average SAT score 0.64). The generated planners solve new instances rapidly (average 0.49 seconds per task) and at low cost (average $1.82 per domain using GPT-4o).

Problem

Research questions and friction points this paper is trying to address.

generalized planning

PDDL

domain-dependent planner

plan generalization

classical planning

Innovation

Methods, ideas, or system contributions that make the work stand out.

Generalized Planning

Large Language Models

Evolutionary Algorithms