🤖 AI Summary
Existing combinatorial optimization (CO) benchmarks predominantly rely on small-scale synthetic data, lacking systematic evaluation of machine learning (ML) solvers’ effectiveness and scalability on real-world, large-scale industrial instances, and suffer from severe training data scarcity. To address this, we introduce FrontierCO—the first unified benchmark covering eight classical CO problem classes, featuring industrially relevant instance scales and abundant high-quality labeled training data. We conduct the first comprehensive empirical evaluation of 16 state-of-the-art ML solvers—including graph neural networks (GNNs) and large language model (LLM)-based agents—within a unified end-to-end modeling, training, and evaluation framework. Our experiments uncover critical bottlenecks in cross-scale generalization, computational scalability, and practical deployability of current approaches. All benchmark data, source code, and trained models are publicly released on Hugging Face to ensure full reproducibility and community accessibility.
📝 Abstract
Machine learning (ML) has demonstrated considerable potential in supporting model design and optimization for combinatorial optimization (CO) problems. However, much of the progress to date has been evaluated on small-scale, synthetic datasets, raising concerns about the practical effectiveness of ML-based solvers in real-world, large-scale CO scenarios. Additionally, many existing CO benchmarks lack sufficient training data, limiting their utility for evaluating data-driven approaches. To address these limitations, we introduce FrontierCO, a comprehensive benchmark that covers eight canonical CO problem types and evaluates 16 representative ML-based solvers--including graph neural networks and large language model (LLM) agents. FrontierCO features challenging instances drawn from industrial applications and frontier CO research, offering both realistic problem difficulty and abundant training data. Our empirical results provide critical insights into the strengths and limitations of current ML methods, helping to guide more robust and practically relevant advances at the intersection of machine learning and combinatorial optimization. Our data is available at https://huggingface.co/datasets/CO-Bench/FrontierCO.