MDP modeling for multi-stage stochastic programs

📅 2025-09-26

📈 Citations: 0

✨ Influential: 0

career value

181K/year

🤖 AI Summary

This paper addresses a class of multistage stochastic programming problems characterized by continuous state and action spaces, decision-dependent uncertainty, and limited statistical learning capability. To overcome the expressive limitations of conventional models, we propose an extended policy graph framework that explicitly captures the feedback effect of decisions on uncertainty and incorporates online learning mechanisms. Building upon this, we design a novel stochastic dual dynamic programming (SDDP) algorithm and its nonconvex approximation variant, tailored for efficiently solving such structured Markov decision processes. Experimental results on a suite of benchmark instances—increasing in complexity—demonstrate that our approach significantly improves policy quality and computational scalability. The work establishes a new paradigm for stochastic optimization that jointly integrates statistical learning with sequential decision-making, offering both enhanced expressiveness and tractability.

Technology Category

Application Category

📝 Abstract

We study a class of multi-stage stochastic programs, which incorporate modeling features from Markov decision processes (MDPs). This class includes structured MDPs with continuous state and action spaces. We extend policy graphs to include decision-dependent uncertainty for one-step transition probabilities as well as a limited form of statistical learning. We focus on the expressiveness of our modeling approach, illustrating ideas with a series of examples of increasing complexity. As a solution method, we develop new variants of stochastic dual dynamic programming, including approximations to handle non-convexities.

Problem

Research questions and friction points this paper is trying to address.

Extends MDP modeling for multi-stage stochastic programs

Incorporates decision-dependent uncertainty in transition probabilities

Develops new stochastic dual dynamic programming variants

Innovation

Methods, ideas, or system contributions that make the work stand out.

Extended policy graphs with decision-dependent uncertainty

Developed stochastic dual dynamic programming variants

Approximations to handle non-convex optimization problems

🔎 Similar Papers

No similar papers found.