🤖 AI Summary
This work addresses the challenge of handling fuzzy predicates—such as “sufficiently stable”—in natural language planning, whose graded semantics are incompatible with existing category-theoretic planners that rely solely on hard constraints. To bridge this gap, the paper introduces Fuzzy Categorical Planning (FCP), the first framework integrating fuzzy logic with categorical planning. FCP models action applicability via degrees in [0,1], composes plan quality using Łukasiewicz t-norms, and preserves pullback-based executability checks. It further incorporates a residual operator to enable bidirectional search and leverages large language models (LLMs) combined with k-sample median aggregation for robust mapping from natural language descriptions to graded applicability scores. Evaluated on RecipeNLG-Subs and PDDL3 benchmarks, FCP significantly reduces hard constraint violations and improves success rates over pure LLM and ReAct baselines.
📝 Abstract
Natural-language planning often involves vague predicates (e.g., suitable substitute, stable enough) whose satisfaction is inherently graded. Existing category-theoretic planners provide compositional structure and pullback-based hard-constraint verification, but treat applicability as crisp, forcing thresholding that collapses meaningful distinctions and cannot track quality degradation across multi-step plans. We propose Fuzzy Category-theoretic Planning (FCP), which annotates each action (morphism) with a degree in [0,1], composes plan quality via a t-norm Lukasiewicz, and retains crisp executability checks via pullback verification. FCP grounds graded applicability from language using an LLM with k-sample median aggregation and supports meeting-in-the-middle search using residuum-based backward requirements. We evaluate on (i) public PDDL3 preference/oversubscription benchmarks and (ii) RecipeNLG-Subs, a missing-substitute recipe-planning benchmark built from RecipeNLG with substitution candidates from Recipe1MSubs and FoodKG. FCP improves success and reduces hard-constraint violations on RecipeNLG-Subs compared to LLM-only and ReAct-style baselines, while remaining competitive with classical PDDL3 planners.