LightAutoDS-Tab: Multi-AutoML Agentic System for Tabular Data

📅 2025-07-17
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing AutoML systems for tabular data tasks suffer from limited flexibility and robustness due to over-reliance on a single underlying tool. This work proposes a collaborative multi-AutoML agent framework that integrates LLM-driven code generation, dynamic multi-engine scheduling, and task-feedback-guided iterative optimization to achieve end-to-end automated modeling. By decoupling the system from fixed tool dependencies, the architecture significantly enhances adaptability and fault tolerance in challenging scenarios—including heterogeneous data, noisy inputs, and small-sample regimes. Evaluated on multiple Kaggle benchmark tasks, our approach surpasses current open-source state-of-the-art methods, achieving an average performance gain of 3.2% while demonstrating superior generalization and stability. The implementation is publicly available.

Technology Category

Application Category

📝 Abstract
AutoML has advanced in handling complex tasks using the integration of LLMs, yet its efficiency remains limited by dependence on specific underlying tools. In this paper, we introduce LightAutoDS-Tab, a multi-AutoML agentic system for tasks with tabular data, which combines an LLM-based code generation with several AutoML tools. Our approach improves the flexibility and robustness of pipeline design, outperforming state-of-the-art open-source solutions on several data science tasks from Kaggle. The code of LightAutoDS-Tab is available in the open repository https://github.com/sb-ai-lab/LADS
Problem

Research questions and friction points this paper is trying to address.

Enhances AutoML flexibility for tabular data tasks
Integrates LLM-based code generation with AutoML tools
Improves pipeline design robustness over existing solutions
Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM-based code generation for AutoML
Integration of multiple AutoML tools
Enhanced pipeline flexibility and robustness
🔎 Similar Papers
A
Aleksey Lapin
ITMO University
I
Igor Hromov
Sber AI Lab
S
Stanislav Chumakov
ITMO University
Mile Mitrovic
Mile Mitrovic
Sber AI Lab
Machine LearningDeep LearningOptimizationAlgorithms
Dmitry Simakov
Dmitry Simakov
Sber AI Lab
data science
N
Nikolay O. Nikitin
ITMO University
A
Andrey V. Savchenko
Sber AI Lab