Deep Controlled Learning for Inventory Control

📅 2020-11-30

🏛️ European Journal of Operational Research

📈 Citations: 8

✨ Influential: 1

🤖 AI Summary

Classical deep reinforcement learning (DRL) methods struggle to consistently outperform heuristic policies—such as the capped base-stock policy—in inventory optimization under dynamic and uncertain demand, primarily due to their failure to explicitly model system stochasticity and physical constraints. Method: This paper proposes the first end-to-end inventory control framework integrating controlled stochastic differential equations (SDEs) with deep neural networks. It models demand and inventory dynamics in continuous time, leverages stochastic optimal control theory to ensure policy interpretability, and employs gradient-augmented policy optimization for data-driven adaptability. Contribution/Results: Evaluated across diverse supply chain simulation environments, the framework significantly reduces stockout rates and holding costs. It achieves an average total cost reduction of 18.7% compared to state-of-the-art DRL baselines SAC and DQN, demonstrating superior robustness, interpretability, and empirical performance under uncertainty.

Problem

Research questions and friction points this paper is trying to address.

Traditional DRL algorithms underperform in inventory management.

Highly stochastic inventory problems need tailored DRL solutions.

Proposed DCL algorithm outperforms heuristics and existing DRL methods.

Innovation

Methods, ideas, or system contributions that make the work stand out.

Deep Controlled Learning for stochastic inventory problems

Combines Sequential Halving with Common Random Numbers

Outperforms heuristics with consistent hyperparameters

🔎 Similar Papers

Deep Reinforcement Learning for Dynamic Order Picking in Warehouse Operations