Efficient Strategy Synthesis for MDPs via Hierarchical Block Decomposition

📅 2025-06-21

📈 Citations: 0

✨ Influential: 0

career value

230K/year

🤖 AI Summary

To address the poor scalability of conventional policy synthesis methods for large-scale Markov decision processes (MDPs), this paper proposes a vulnerability-driven hierarchical block decomposition approach. The method iteratively refines the model dynamically and selects regions based on uncertainty awareness, focusing computational effort exclusively on the currently most vulnerable state subsets for fine-grained modeling and optimization—thereby jointly improving accuracy and efficiency. Its core innovation lies in recasting policy synthesis as an incremental refinement process targeted at critical uncertain regions, circumventing prohibitively expensive global computations. Experiments on MDP benchmarks with over one million states demonstrate that our approach achieves up to a 2× speedup over the state-of-the-art tool PRISM, significantly enhancing the feasibility and practicality of policy synthesis for large-scale systems.

Technology Category

Application Category

📝 Abstract

Software-intensive systems, such as software product lines and robotics, utilise Markov decision processes (MDPs) to capture uncertainty and analyse sequential decision-making problems. Despite the usefulness of conventional policy synthesis methods, they fail to scale to large state spaces. Our approach addresses this issue and accelerates policy synthesis in large MDPs by dynamically refining the MDP and iteratively selecting the most fragile MDP regions for refinement. This iterative procedure offers a balance between accuracy and efficiency, as refinement occurs only when necessary. Through a comprehensive empirical evaluation comprising diverse case studies and MDPs up to 1M states, we demonstrate significant performance improvements yielded by our approach compared to the leading probabilistic model checker PRISM (up to 2x), thus offering a very competitive solution for real-world policy synthesis tasks in larger MDPs.

Problem

Research questions and friction points this paper is trying to address.

Scaling policy synthesis for large MDP state spaces

Improving efficiency via dynamic MDP refinement

Balancing accuracy and speed in iterative refinement

Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical block decomposition for MDPs

Dynamic refinement of fragile regions

Iterative balance between accuracy and efficiency

🔎 Similar Papers

Synthesis of Hierarchical Controllers Based on Deep Reinforcement Learning Policies