NNGPT: Rethinking AutoML with Large Language Models

📅 2025-11-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Current AI systems lack sufficient self-evolution capability, particularly in automated neural architecture design for computer vision. Method: This paper proposes the first AutoML framework integrating large language models (LLMs) with a self-evolution mechanism for end-to-end neural network automation in computer vision. It establishes a “generate–evaluate–improve” closed loop and introduces NN-RAG—a retrieval-augmented generation module for executable PyTorch code—enabling framework-agnostic modeling. The framework further incorporates zero-shot architecture generation, code-aware accuracy prediction, hash-based deduplication, and lightweight PyTorch adapters to enhance reproducibility and efficiency. Contributions/Results: The system has validated over 5,000 models; NN-RAG achieves a 73% executable code generation rate; hyperparameter optimization yields an RMSE of 0.60—outperforming Optuna; code-aware accuracy prediction achieves an RMSE of 0.14 and a Pearson correlation coefficient of 0.78.

Technology Category

Application Category

📝 Abstract
Building self-improving AI systems remains a fundamental challenge in the AI domain. We present NNGPT, an open-source framework that turns a large language model (LLM) into a self-improving AutoML engine for neural network development, primarily for computer vision. Unlike previous frameworks, NNGPT extends the dataset of neural networks by generating new models, enabling continuous fine-tuning of LLMs based on closed-loop system of generation, assessment, and self-improvement. It integrates within one unified workflow five synergistic LLM-based pipelines: zero-shot architecture synthesis, hyperparameter optimization (HPO), code-aware accuracy/early-stop prediction, retrieval-augmented synthesis of scope-closed PyTorch blocks (NN-RAG), and reinforcement learning. Built on the LEMUR dataset as an audited corpus with reproducible metrics, NNGPT emits from a single prompt and validates network architecture, preprocessing code, and hyperparameters, executes them end-to-end, and learns from result. The PyTorch adapter makes NNGPT framework-agnostic, enabling strong performance: NN-RAG achieves 73% executability on 1,289 targets, 3-shot prompting boosts accuracy on common datasets, and hash-based deduplication saves hundreds of runs. One-shot prediction matches search-based AutoML, reducing the need for numerous trials. HPO on LEMUR achieves RMSE 0.60, outperforming Optuna (0.64), while the code-aware predictor reaches RMSE 0.14 with Pearson r=0.78. The system has already generated over 5K validated models, proving NNGPT as an autonomous AutoML engine. Upon acceptance, the code, prompts, and checkpoints will be released for public access to enable reproducibility and facilitate community usage.
Problem

Research questions and friction points this paper is trying to address.

Building self-improving AI systems for neural network development
Automating neural architecture synthesis and hyperparameter optimization
Creating continuous learning AutoML through generation-assessment cycles
Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM-based AutoML engine for neural network development
Closed-loop system with generation, assessment, self-improvement
Unified workflow integrating five synergistic LLM pipelines
🔎 Similar Papers
R
Roman Kochnev
Computer Vision Lab, CAIDAS & IFI, University of Ẅurzburg, Germany
W
Waleed Khalid
Computer Vision Lab, CAIDAS & IFI, University of Ẅurzburg, Germany
T
Tolgay Atinc Uzun
Computer Vision Lab, CAIDAS & IFI, University of Ẅurzburg, Germany
X
Xi Zhang
Computer Vision Lab, CAIDAS & IFI, University of Ẅurzburg, Germany
Y
Yashkumar Sanjaybhai Dhameliya
Computer Vision Lab, CAIDAS & IFI, University of Ẅurzburg, Germany
F
Furui Qin
Computer Vision Lab, CAIDAS & IFI, University of Ẅurzburg, Germany
C
Chandini Vysyaraju
Computer Vision Lab, CAIDAS & IFI, University of Ẅurzburg, Germany
R
Raghuvir Duvvuri
Computer Vision Lab, CAIDAS & IFI, University of Ẅurzburg, Germany
A
Avi Goyal
Computer Vision Lab, CAIDAS & IFI, University of Ẅurzburg, Germany
Dmitry Ignatov
Dmitry Ignatov
Associate Professor, MMCP Lab Head, Computer Science Faculty, Higher School of Economics
Data MiningMachine LearningFormal Concept AnalysisAIInformation Retrieval
Radu Timofte
Radu Timofte
Humboldt Professor for AI and Computer Vision, University of Würzburg
Computer VisionMachine LearningAICompressionComputational Photography