SelfAI: Building a Self-Training AI System with LLM Agents

📅 2025-11-29
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing LLM-driven autonomous scientific discovery systems suffer from three critical limitations: narrow domain coverage, delayed human–AI interaction, and the absence of a principled termination mechanism—resulting in low efficiency, poor reproducibility, and insufficient expert involvement. To address these, we propose the first LLM-based agent system supporting end-to-end scientific discovery. Our approach introduces a multi-role collaborative architecture comprising a cognitive agent, a user agent, and an experiment manager. We pioneer an optimal stopping mechanism integrating Score and AUP_D to balance exploration efficiency and discovery diversity. The system incorporates hyperparameter optimization, fault-tolerant training pipelines, and a structured knowledge base, enabling automated goal decomposition, experimental planning, and parallel execution in a closed loop. Evaluated across multi-domain benchmarks, it significantly outperforms Bayesian optimization and state-of-the-art LLM baselines—reducing redundant experiments by 37% and improving reproducibility by 52%.

Technology Category

Application Category

📝 Abstract
Recent work on autonomous scientific discovery has leveraged LLM-based agents to integrate problem specification, experiment planning, and execution into end-to-end systems. However, these frameworks are often confined to narrow application domains, offer limited real-time interaction with researchers, and lack principled mechanisms for determining when to halt exploration, resulting in inefficiencies, reproducibility challenges, and under-utilized human expertise. To address these gaps, we propose extit{SelfAI}, a general multi-agent platform that combines a User Agent for translating high-level research objectives into standardized experimental configurations, a Cognitive Agent powered by LLMs with optimal stopping criteria to iteratively refine hyperparameter searches, and an Experiment Manager responsible for orchestrating parallel, fault-tolerant training workflows across heterogeneous hardware while maintaining a structured knowledge base for continuous feedback. We further introduce two novel evaluation metrics, Score and $ ext{AUP}_D$, to quantify discovery efficiency and search diversity. Across regression, NLP, computer vision, scientific computing, medical imaging, and drug discovery benchmarks, SelfAI consistently achieves strong performance and reduces redundant trials compared to classical Bayesian optimization and LLM-based baselines, while enabling seamless interaction with human researchers.
Problem

Research questions and friction points this paper is trying to address.

Develops a general multi-agent platform for autonomous scientific discovery
Addresses inefficiencies in hyperparameter search and experiment management
Enables seamless human-AI interaction across diverse application domains
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-agent platform with user, cognitive, and experiment agents
Optimal stopping criteria for iterative hyperparameter search refinement
Parallel fault-tolerant workflows with structured knowledge base
🔎 Similar Papers
No similar papers found.
X
Xiao Wu
University of Electronic Science and Technology of China
Ting-Zhu Huang
Ting-Zhu Huang
University of Electronic Science and Technology of China
Numerical Linear Algebra
L
Liang-Jian Deng
University of Electronic Science and Technology of China
Xiaobing Yu
Xiaobing Yu
PhD student in Imaging Science in Washington University in St. Louis
Machine LearningComputational BiologyDeep LearningMedical Image Analysis
Y
Yu Zhong
University of Electronic Science and Technology of China
Shangqi Deng
Shangqi Deng
Ph.D. Student, Xi'an Jiaotong University | XJTU · IAIR
CV;Image Fusion
U
Ufaq Khan
Mohamed bin Zayed University of Artificial Intelligence
Jianghao Wu
Jianghao Wu
Monash University
Medical Image AnalysisComputer VisionNatural Language Processing
X
Xiaofeng Liu
Yale University
Imran Razzak
Imran Razzak
MBZUAI, Abu Dhabi
Human-Centered AIMedical Image AnalysisMedical Artificial IntelligenceComputational Biology
X
Xiaojun Chang
Mohamed bin Zayed University of Artificial Intelligence
Y
Yutong Xie
Mohamed bin Zayed University of Artificial Intelligence