OMGPT: A Sequence Modeling Framework for Data-driven Operational Decision Making

📅 2025-05-19
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses sequential decision-making problems in operations research—including dynamic pricing, inventory management, resource allocation, and queue control—by introducing OMGPT, the first end-to-end generative pre-trained Transformer tailored for operations decisions. Unlike conventional approaches relying on analytical models or task-specific heuristics, OMGPT frames the mapping from historical system states to optimal actions as a unified sequence-to-sequence prediction task, enabling model-free, purely data-driven generalization. Methodologically, it integrates the Transformer architecture with Bayesian theoretical analysis and large-scale pretraining with transfer learning. Extensive experiments demonstrate that OMGPT significantly outperforms existing baselines across diverse operations tasks, exhibiting strong cross-task generalization and robustness to environmental variations. It establishes a scalable, transferable paradigm for operations decision-making, bridging deep learning and classical OR without requiring domain-specific modeling assumptions.

Technology Category

Application Category

📝 Abstract
We build a Generative Pre-trained Transformer (GPT) model from scratch to solve sequential decision making tasks arising in contexts of operations research and management science which we call OMGPT. We first propose a general sequence modeling framework to cover several operational decision making tasks as special cases, such as dynamic pricing, inventory management, resource allocation, and queueing control. Under the framework, all these tasks can be viewed as a sequential prediction problem where the goal is to predict the optimal future action given all the historical information. Then we train a transformer-based neural network model (OMGPT) as a natural and powerful architecture for sequential modeling. This marks a paradigm shift compared to the existing methods for these OR/OM tasks in that (i) the OMGPT model can take advantage of the huge amount of pre-trained data; (ii) when tackling these problems, OMGPT does not assume any analytical model structure and enables a direct and rich mapping from the history to the future actions. Either of these two aspects, to the best of our knowledge, is not achieved by any existing method. We establish a Bayesian perspective to theoretically understand the working mechanism of the OMGPT on these tasks, which relates its performance with the pre-training task diversity and the divergence between the testing task and pre-training tasks. Numerically, we observe a surprising performance of the proposed model across all the above tasks.
Problem

Research questions and friction points this paper is trying to address.

Develops OMGPT for sequential operational decision-making tasks
Proposes a unified framework for dynamic pricing, inventory, and resource tasks
Uses transformer models without assuming analytical structures
Innovation

Methods, ideas, or system contributions that make the work stand out.

Develops a GPT model for operational decision-making tasks
Uses transformer-based neural network for sequential modeling
Leverages pre-trained data without assuming analytical model structure
🔎 Similar Papers
No similar papers found.
Hanzhao Wang
Hanzhao Wang
University of Sydney
G
Guanting Chen
Department of Statistics and Operations Research, University of North Carolina at Chapel Hill
K
K. Talluri
Imperial College Business School, Imperial College London
Xiaocheng Li
Xiaocheng Li
Imperial College Business School, Imperial College London
Machine learningoperations research