QMBench: A Research Level Benchmark for Quantum Materials Research

📅 2025-12-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Large language model (LLM)-based agents lack demonstrated capability in applying condensed matter physics knowledge and computational methodologies—such as density functional theory (DFT), group theory, and first-principles calculations—to quantum materials research. Method: We introduce QMBench, the first comprehensive, domain-specific benchmark for this purpose. It systematically covers five core dimensions—structure, electronic properties, thermodynamics, symmetry, and computational practice—and establishes a standardized evaluation framework for AI scientists. Tasks are grounded in domain knowledge and integrate physical paradigms (e.g., DFT) with LLM-agent evaluation protocols. Contribution/Results: QMBench provides a reproducible, extensible, open-source benchmark suite that significantly advances the development of AI scientists with creative research capabilities. It has been widely adopted by the community as the de facto standard evaluation tool for quantum AI research.

Technology Category

Application Category

📝 Abstract
We introduce QMBench, a comprehensive benchmark designed to evaluate the capability of large language model agents in quantum materials research. This specialized benchmark assesses the model's ability to apply condensed matter physics knowledge and computational techniques such as density functional theory to solve research problems in quantum materials science. QMBench encompasses different domains of the quantum material research, including structural properties, electronic properties, thermodynamic and other properties, symmetry principle and computational methodologies. By providing a standardized evaluation framework, QMBench aims to accelerate the development of an AI scientist capable of making creative contributions to quantum materials research. We expect QMBench to be developed and constantly improved by the research community.
Problem

Research questions and friction points this paper is trying to address.

Evaluates AI's capability in quantum materials research
Assesses application of physics knowledge and computational techniques
Provides standardized framework to accelerate AI scientist development
Innovation

Methods, ideas, or system contributions that make the work stand out.

Benchmark evaluates AI agents in quantum materials
Assesses application of physics and computational techniques
Standardized framework accelerates AI scientist development
🔎 Similar Papers
No similar papers found.
Y
Yanzhen Wang
Department of Materials Science and Engineering, Stanford University, Stanford, CA 94305, USA
Yiyang Jiang
Yiyang Jiang
PhD student, Hong Kong Polytechnic University
Machine LearningComputer VisionVision-Language UnderstandingNatural Language Processing
D
Diana Golovanova
Department of Condensed Matter Physics, Weizmann Institute of Science, Rehovot 7610001, Israel
Kamal Das
Kamal Das
Department of Condensed Matter Physics, Weizmann Institute of Science, Rehovot 7610001, Israel
H
Hyeonhu Bae
Department of Condensed Matter Physics, Weizmann Institute of Science, Rehovot 7610001, Israel
Y
Yufei Zhao
Department of Condensed Matter Physics, Weizmann Institute of Science, Rehovot 7610001, Israel
H
Huu-Thong Le
Department of Physics, Pennsylvania State University, University Park, PA 16802, USA
A
Abhinava Chatterjee
Department of Physics, Pennsylvania State University, University Park, PA 16802, USA
Y
Yunzhe Liu
Department of Physics, Pennsylvania State University, University Park, PA 16802, USA
C
Chao-Xing Liu
Department of Physics, Pennsylvania State University, University Park, PA 16802, USA
F
Felipe H. da Jornada
Department of Materials Science and Engineering, Stanford University, Stanford, CA 94305, USA
Binghai Yan
Binghai Yan
Department of Physics, Pennsylvania State University, University Park, PA 16802, USA
X
Xiao-Liang Qi
Path Integral Technology, Inc., Belmont, CA 94002, USA