RESCHED: Rethinking Flexible Job Shop Scheduling from a Transformer-based Architecture with Simplified States

📅 2026-03-07
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the limited generalization capability of existing approaches to the flexible job shop scheduling problem (FJSP), which often rely on complex handcrafted features or graph neural networks. To overcome this challenge, the authors propose a lightweight deep reinforcement learning framework. By reformulating the FJSP as a Markov decision process with a state space reduced to four core features, they design a scheduling-oriented, lightweight Transformer architecture incorporating a subproblem-aware state representation, dot-product attention mechanism, and three targeted structural enhancements. The resulting method substantially reduces modeling complexity, outperforms conventional dispatching rules and current deep reinforcement learning approaches on FJSP, and achieves performance comparable to specialized neural models on related variants such as the job shop scheduling problem (JSSP) and the flexible flow shop scheduling problem (FFSP), demonstrating strong cross-problem generalization.

Technology Category

Application Category

📝 Abstract
Neural approaches to the Flexible Job Shop Scheduling Problem (FJSP), particularly those based on deep reinforcement learning (DRL), have gained growing attention in recent years. However, existing methods rely on complex feature-engineered state representations (i.e., often requiring more than 20 handcrafted features) and graph-biased neural architectures. To reduce modeling complexity and advance a more generalizable framework for FJSP, we introduce \textsc{ReSched}, a minimalist DRL framework that rethinks both the scheduling formulation and model design. First, by revisiting the Markov Decision Process (MDP) formulation of FJSP, we condense the state space to just four essential features, eliminating historical dependencies through a subproblem-based perspective. Second, we employ Transformer blocks with dot-product attention, augmented by three lightweight but effective architectural modifications tailored to scheduling tasks. Extensive experiments show that \textsc{ReSched} outperforms classical dispatching rules and state-of-the-art DRL methods on FJSP. Moreover, \textsc{ReSched} also generalizes well to the Job Shop Scheduling Problem (JSSP) and the Flexible Flow Shop Scheduling Problem (FFSP), achieving competitive performance against neural baselines specifically designed for these variants.
Problem

Research questions and friction points this paper is trying to address.

Flexible Job Shop Scheduling Problem
deep reinforcement learning
state representation
model complexity
generalization
Innovation

Methods, ideas, or system contributions that make the work stand out.

Flexible Job Shop Scheduling
Deep Reinforcement Learning
Transformer Architecture
State Simplification
Generalizable Scheduling
🔎 Similar Papers
No similar papers found.
X
Xiangjie Xiao
School of Computing and Information Systems, Singapore Management University, Singapore
C
Cong Zhang
College of Computing and Data Science, Nanyang Technological University, Singapore
Wen Song
Wen Song
Shandong University
Artificial IntelligenceDeep Reinforcement LearningPlanning and SchedulingMulti-agent SystemsCombinatorial Optimization
Zhiguang Cao
Zhiguang Cao
Singapore Management University
Learning to OptimizeNeural Combinatorial OptimizationComputational Intelligence