In Search of Trees: Decision-Tree Policy Synthesis for Black-Box Systems via Search

📅 2024-09-05

🏛️ arXiv.org

📈 Citations: 0

✨ Influential: 0

career value

230K/year

🤖 AI Summary

Synthesizing optimal, interpretable controllers for black-box dynamical systems—without access to analytical models or expert demonstrations—remains a fundamental challenge. Method: This paper proposes a model- and policy-agnostic decision-tree controller synthesis method. Given input-output trajectory data, a discrete predicate set, and an initial state set, it employs a trajectory-driven pruning search algorithm to construct decision trees that are provably step-optimal and structurally minimal in the black-box setting. Key technical components include predicate-space discretization, combinatorial trajectory evaluation, domain-specific pruning rules, and black-box interaction-based verification. Results: Evaluated on multiple control benchmarks, the synthesized policies are compact, formally verifiable, and achieve 10–100× speedup in synthesis time over MILP-based and reinforcement learning baselines. The approach overcomes longstanding limitations of white-box modeling assumptions and imitation learning dependencies, enabling scalable, certifiable control synthesis for unknown dynamics.

Technology Category

Application Category

📝 Abstract

Decision trees, owing to their interpretability, are attractive as control policies for (dynamical) systems. Unfortunately, constructing, or synthesising, such policies is a challenging task. Previous approaches do so by imitating a neural-network policy, approximating a tabular policy obtained via formal synthesis, employing reinforcement learning, or modelling the problem as a mixed-integer linear program. However, these works may require access to a hard-to-obtain accurate policy or a formal model of the environment (within reach of formal synthesis), and may not provide guarantees on the quality or size of the final tree policy. In contrast, we present an approach to synthesise optimal decision-tree policies given a deterministic black-box environment and specification, a discretisation of the tree predicates, and an initial set of states, where optimality is defined with respect to the number of steps to achieve the goal. Our approach is a specialised search algorithm which systematically explores the (exponentially large) space of decision trees under the given discretisation. The key component is a novel trace-based pruning mechanism that significantly reduces the search space. Our approach represents a conceptually novel way of synthesising small decision-tree policies with optimality guarantees even for black-box environments with black-box specifications.

Problem

Research questions and friction points this paper is trying to address.

Black-box Systems

Decision Tree Optimization

Dynamic System Control

Innovation

Methods, ideas, or system contributions that make the work stand out.

Black-box Systems

Optimal Decision Tree Generation

Efficient Search Algorithm

🔎 Similar Papers

SYMPOL: Symbolic Tree-Based On-Policy Reinforcement Learning