Addressing Terminal Constraints in Data-Driven Demand Response Scheduling

📅 2026-05-14

📈 Citations: 0

✨ Influential: 0

career value

163K/year

🤖 AI Summary

This work addresses the challenges in data-driven demand response scheduling, where terminal constraints are difficult to satisfy over long time horizons, conventional model-based optimization incurs high computational costs, and reinforcement learning is often hindered by myopic behavior and credit assignment issues. To overcome these limitations, the paper proposes an approach that integrates Goal Space Planning (GSP) with Deep Deterministic Policy Gradient (DDPG). By constructing a temporally abstract model over discrete subgoals, the method effectively propagates long-horizon value signals, thereby alleviating credit assignment difficulties and enhancing adherence to terminal constraints. Experimental results on an air separation simulation benchmark demonstrate that the proposed method significantly outperforms standard DDPG, achieving substantial improvements in both sample efficiency and constraint satisfaction.

📝 Abstract

Electrified chemical processes are incentivized by exposure to time-varying electricity markets to operate flexibly, but participating in demand response schemes can require satisfying terminal constraints over long horizons. Specifically, terminal constraints may be required when computing optimal schedules in order to preserve dynamic stability. Model-based optimization methods are computationally costly, and data-driven scheduling via reinforcement learning (RL) faces severe credit-assignment challenges. We integrate Goal-Space Planning (GSP) with Deep Deterministic Policy Gradient (DDPG), using learned temporally abstract models over discrete subgoals to propagate value across extended horizons. Using a simulated air separation benchmark, we demonstrate the proposed approach improves sample efficiency over standard DDPG while satisfying terminal storage constraints, mitigating myopic control behavior.

Problem

Research questions and friction points this paper is trying to address.

terminal constraints

demand response scheduling

dynamic stability

sample efficiency

credit assignment

Innovation

Methods, ideas, or system contributions that make the work stand out.

Goal-Space Planning

Deep Deterministic Policy Gradient

Terminal Constraints