On Learning Action Costs from Input Plans

📅 2024-08-20
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses the problem of unknown action costs in classical planning, formally introducing the novel task of *learning action costs from input plans*: given a set of unlabeled optimal (or k-optimal) plans, infer action costs such that all input plans are optimal (or k-optimal) under the learned cost model. To solve this, we propose LACFIP<sup>k</sup>, an algorithm integrating integer linear programming modeling, feasibility-driven iterative optimization, cost-space pruning, and explicit k-optimality constraints. We provide a theoretical proof of its finite-step convergence. Experiments across multiple planning domains demonstrate that LACFIP<sup>k</sup> perfectly recovers the correct action cost ranking (100% accuracy), significantly outperforming existing baselines. Our work establishes a new paradigm for inverse planning and enhances model interpretability by enabling cost inference directly from observed optimal behavior.

Technology Category

Application Category

📝 Abstract
Most of the work on learning action models focus on learning the actions' dynamics from input plans. This allows us to specify the valid plans of a planning task. However, very little work focuses on learning action costs, which in turn allows us to rank the different plans. In this paper we introduce a new problem: that of learning the costs of a set of actions such that a set of input plans are optimal under the resulting planning model. To solve this problem we present $LACFIP^k$, an algorithm to learn action's costs from unlabeled input plans. We provide theoretical and empirical results showing how $LACFIP^k$ can successfully solve this task.
Problem

Research questions and friction points this paper is trying to address.

Learning action costs from unlabeled input plans
Ensuring input plans become optimal solutions
Ranking different plans through cost optimization
Innovation

Methods, ideas, or system contributions that make the work stand out.

Learning action costs from unlabeled plans
Algorithm LACFIP^k solves cost optimization
Ensures input plans become optimal solutions
🔎 Similar Papers
No similar papers found.