GENIUS: An Agentic AI Framework for Autonomous Design and Execution of Simulation Protocols

📅 2025-12-06
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Non-expert users face significant barriers in efficiently utilizing first-principles simulation codes for materials modeling. Method: This paper proposes an AI agent framework integrating knowledge graphs, hierarchical large language models (LLMs), and finite-state machines to enable end-to-end automatic translation of natural-language instructions into executable simulation input files, while supporting protocol design, validation, and self-healing of errors. Contribution/Results: The framework introduces a knowledge-augmented reasoning architecture and a state-driven hallucination suppression mechanism—enabling, for the first time, autonomous simulation protocol generation and closed-loop error correction. Evaluated on 295 benchmark tasks, it achieves an 80% task success rate, 76% autonomous error repair rate, and reduces failure rate to 7%. Moreover, inference cost is halved compared to pure-LLM approaches, substantially advancing Integrated Computational Materials Engineering (ICME) toward low-barrier, high-reliability deployment.

Technology Category

Application Category

📝 Abstract
Predictive atomistic simulations have propelled materials discovery, yet routine setup and debugging still demand computer specialists. This know-how gap limits Integrated Computational Materials Engineering (ICME), where state-of-the-art codes exist but remain cumbersome for non-experts. We address this bottleneck with GENIUS, an AI-agentic workflow that fuses a smart Quantum ESPRESSO knowledge graph with a tiered hierarchy of large language models supervised by a finite-state error-recovery machine. Here we show that GENIUS translates free-form human-generated prompts into validated input files that run to completion on $approx$80% of 295 diverse benchmarks, where 76% are autonomously repaired, with success decaying exponentially to a 7% baseline. Compared with LLM-only baselines, GENIUS halves inference costs and virtually eliminates hallucinations. The framework democratizes electronic-structure DFT simulations by intelligently automating protocol generation, validation, and repair, opening large-scale screening and accelerating ICME design loops across academia and industry worldwide.
Problem

Research questions and friction points this paper is trying to address.

Automates simulation setup for non-experts
Reduces reliance on specialists for materials engineering
Enhances accuracy and efficiency in computational workflows
Innovation

Methods, ideas, or system contributions that make the work stand out.

AI-agentic workflow automates simulation setup
Knowledge graph and LLMs enable autonomous repair
Finite-state machine reduces hallucinations and costs
🔎 Similar Papers
No similar papers found.
M
Mohammad Soleymanibrojeni
Karlsruhe Institute of Technology, Institute of Nanotechnology Hermann-von-Helmholtz-Platz, Karlsruhe, 76021, Germany
Roland Aydin
Roland Aydin
Professor at Hamburg University of Technology, Germany
Large Language ModelsMachine LearningMaterials Science
D
Diego Guedes-Sobrinho
Federal University of Paraná, Department of Chemistry, Curitiba, 81531-980, Brazil
A
Alexandre C. Dias
University of Brasília, Institute of Physics and International Center of Physics, Brasília, 70919-970, Brazil
M
Maurício J. Piotrowski
Federal University of Pelotas, Department of Physics, Pelotas, 96010-900, Brazil
W
Wolfgang Wenzel
Karlsruhe Institute of Technology, Institute of Nanotechnology Hermann-von-Helmholtz-Platz, Karlsruhe, 76021, Germany
C
Celso Ricardo Caldeira Rêgo
Karlsruhe Institute of Technology, Institute of Nanotechnology Hermann-von-Helmholtz-Platz, Karlsruhe, 76021, Germany