🤖 AI Summary
Existing large language models (LLMs) lack end-to-end capability in abstract, mathematically intensive open-science tasks—particularly in theoretical and computational physics—where rigorous reasoning, symbolic manipulation, and numerical computation must be tightly integrated.
Method: This paper introduces the “AI Physicist” framework: (i) a novel LLM-based agent architecture unifying abstract physical reasoning with symbolic-numerical hybrid computation; (ii) LANDAU—a hierarchical academic knowledge universe designed to enhance decision reliability through structured domain knowledge; and (iii) a long-horizon scientific闭环 integrating methodology-trajectory validation with adaptive reinforcement-driven exploration.
Contribution/Results: The framework achieves three breakthroughs across high-energy physics, condensed-matter physics, and astrophysics: (i) reduction of research cycles from months to hours; (ii) fully automated, hypothesis-driven closed-loop verification; and (iii) autonomous discovery and modeling of open scientific problems—marking a significant advance toward AI-augmented scientific discovery.
📝 Abstract
Advances in LLMs have produced agents with knowledge and operational capabilities comparable to human scientists, suggesting potential to assist, accelerate, and automate research. However, existing studies mainly evaluate such systems on well-defined benchmarks or general tasks like literature retrieval, limiting their end-to-end problem-solving ability in open scientific scenarios. This is particularly true in physics, which is abstract, mathematically intensive, and requires integrating analytical reasoning with code-based computation. To address this, we propose PhysMaster, an LLM-based agent functioning as an autonomous theoretical and computational physicist. PhysMaster couples absract reasoning with numerical computation and leverages LANDAU, the Layered Academic Data Universe, which preserves retrieved literature, curated prior knowledge, and validated methodological traces, enhancing decision reliability and stability. It also employs an adaptive exploration strategy balancing efficiency and open-ended exploration, enabling robust performance in ultra-long-horizon tasks. We evaluate PhysMaster on problems from high-energy theory, condensed matter theory to astrophysics, including: (i) acceleration, compressing labor-intensive research from months to hours; (ii) automation, autonomously executing hypothesis-driven loops ; and (iii) autonomous discovery, independently exploring open problems.