GENIUS: An Agentic AI Framework for Autonomous Design and Execution of Simulation Protocols

📅 2025-12-06

📈 Citations: 0

✨ Influential: 0

career value

237K/year

🤖 AI Summary

Non-expert users face significant barriers in efficiently utilizing first-principles simulation codes for materials modeling. Method: This paper proposes an AI agent framework integrating knowledge graphs, hierarchical large language models (LLMs), and finite-state machines to enable end-to-end automatic translation of natural-language instructions into executable simulation input files, while supporting protocol design, validation, and self-healing of errors. Contribution/Results: The framework introduces a knowledge-augmented reasoning architecture and a state-driven hallucination suppression mechanism—enabling, for the first time, autonomous simulation protocol generation and closed-loop error correction. Evaluated on 295 benchmark tasks, it achieves an 80% task success rate, 76% autonomous error repair rate, and reduces failure rate to 7%. Moreover, inference cost is halved compared to pure-LLM approaches, substantially advancing Integrated Computational Materials Engineering (ICME) toward low-barrier, high-reliability deployment.

Technology Category

Application Category

📝 Abstract

Predictive atomistic simulations have propelled materials discovery, yet routine setup and debugging still demand computer specialists. This know-how gap limits Integrated Computational Materials Engineering (ICME), where state-of-the-art codes exist but remain cumbersome for non-experts. We address this bottleneck with GENIUS, an AI-agentic workflow that fuses a smart Quantum ESPRESSO knowledge graph with a tiered hierarchy of large language models supervised by a finite-state error-recovery machine. Here we show that GENIUS translates free-form human-generated prompts into validated input files that run to completion on $approx$80% of 295 diverse benchmarks, where 76% are autonomously repaired, with success decaying exponentially to a 7% baseline. Compared with LLM-only baselines, GENIUS halves inference costs and virtually eliminates hallucinations. The framework democratizes electronic-structure DFT simulations by intelligently automating protocol generation, validation, and repair, opening large-scale screening and accelerating ICME design loops across academia and industry worldwide.

Problem

Research questions and friction points this paper is trying to address.

Automates simulation setup for non-experts

Reduces reliance on specialists for materials engineering

Enhances accuracy and efficiency in computational workflows

Innovation

Methods, ideas, or system contributions that make the work stand out.

AI-agentic workflow automates simulation setup

Knowledge graph and LLMs enable autonomous repair

Finite-state machine reduces hallucinations and costs

🔎 Similar Papers

Toward Automated Simulation Research Workflow through LLM Prompt Engineering Design.

2024-08-28Journal of Chemical Information and ModelingCitations: 1

MegaAgent: A Large-Scale Autonomous LLM-based Multi-Agent System Without Predefined SOPs

2024-08-19Citations: 0