🤖 AI Summary
This paper addresses the problem of unknown action costs in classical planning, formally introducing the novel task of *learning action costs from input plans*: given a set of unlabeled optimal (or k-optimal) plans, infer action costs such that all input plans are optimal (or k-optimal) under the learned cost model. To solve this, we propose LACFIP<sup>k</sup>, an algorithm integrating integer linear programming modeling, feasibility-driven iterative optimization, cost-space pruning, and explicit k-optimality constraints. We provide a theoretical proof of its finite-step convergence. Experiments across multiple planning domains demonstrate that LACFIP<sup>k</sup> perfectly recovers the correct action cost ranking (100% accuracy), significantly outperforming existing baselines. Our work establishes a new paradigm for inverse planning and enhances model interpretability by enabling cost inference directly from observed optimal behavior.
📝 Abstract
Most of the work on learning action models focus on learning the actions' dynamics from input plans. This allows us to specify the valid plans of a planning task. However, very little work focuses on learning action costs, which in turn allows us to rank the different plans. In this paper we introduce a new problem: that of learning the costs of a set of actions such that a set of input plans are optimal under the resulting planning model. To solve this problem we present $LACFIP^k$, an algorithm to learn action's costs from unlabeled input plans. We provide theoretical and empirical results showing how $LACFIP^k$ can successfully solve this task.