Mamba Meets Scheduling: Learning to Solve Flexible Job Shop Scheduling with Efficient Sequence Modeling

📅 2026-02-24
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of efficiently modeling global dependencies between operations and machines in the flexible job shop scheduling problem (FJSP), where existing learning-based methods often fall short. To this end, we introduce the Mamba state space model—characterized by linear computational complexity—into FJSP for the first time. We propose a dual-branch Mamba encoder to separately capture sequential features of operations and machines, coupled with a lightweight cross-attention decoder to model their interactions for optimizing scheduling objectives such as makespan. Compared to conventional graph attention-based approaches, our method substantially reduces computational overhead while achieving faster solution times and superior performance across multiple FJSP benchmarks, outperforming current state-of-the-art learning-based scheduling algorithms.

Technology Category

Application Category

📝 Abstract
The Flexible Job Shop Problem (FJSP) is a well-studied combinatorial optimization problem with extensive applications for manufacturing and production scheduling. It involves assigning jobs to various machines to optimize criteria, such as minimizing total completion time. Current learning-based methods in this domain often rely on localized feature extraction models, limiting their capacity to capture overarching dependencies spanning operations and machines. This paper introduces an innovative architecture that harnesses Mamba, a state-space model with linear computational complexity, to facilitate comprehensive sequence modeling tailored for FJSP. In contrast to prevalent graph-attention-based frameworks that are computationally intensive for FJSP, we show our model is more efficient. Specifically, the proposed model possesses an encoder and a decoder. The encoder incorporates a dual Mamba block to extract operation and machine features separately. Additionally, we introduce an efficient cross-attention decoder to learn interactive embeddings of operations and machines. Our experimental results demonstrate that our method achieves faster solving speed and surpasses the performance of state-of-the-art learning-based methods for FJSP across various benchmarks.
Problem

Research questions and friction points this paper is trying to address.

Flexible Job Shop Scheduling
Combinatorial Optimization
Sequence Modeling
Global Dependencies
Manufacturing Scheduling
Innovation

Methods, ideas, or system contributions that make the work stand out.

Mamba
Flexible Job Shop Scheduling
State-Space Model
Sequence Modeling
Cross-Attention Decoder
🔎 Similar Papers
No similar papers found.
Z
Zhi Cao
Dalian University of Technology
C
Cong Zhang
Nanyang Technological University
Yaoxin Wu
Yaoxin Wu
Eindhoven University of Technology
Deep learningCombinatorial optimizationInteger programmingMulti-objective optimization
Y
Yaqing Hou
Dalian University of Technology
H
Hongwei Ge
Dalian University of Technology