AI Agents Can Already Autonomously Perform Experimental High Energy Physics

๐Ÿ“… 2026-03-20
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work proposes the Just Furnish Context (JFC) framework, a novel approach that eliminates the need for manual scaffolding in high-energy physics analyses, which have traditionally relied on labor-intensive, error-prone human coding. By providing only a dataset, execution environment, and relevant literature corpus, an autonomous agent powered by Claude Codeโ€”a large language modelโ€”can perform end-to-end analysis, including event selection, background estimation, uncertainty quantification, statistical inference, and scientific paper drafting. The system integrates literature-aware retrieval with a multi-agent review mechanism to ensure rigor and reproducibility. Validated on public datasets from ALEPH, DELPHI, and CMS experiments, JFC successfully reproduces established measurements in electroweak physics, QCD, and Higgs boson studies, generating credible, publication-ready reports while substantially reducing human effort.

Technology Category

Application Category

๐Ÿ“ Abstract
Large language model-based AI agents are now able to autonomously execute substantial portions of a high energy physics (HEP) analysis pipeline with minimal expert-curated input. Given access to a HEP dataset, an execution framework, and a corpus of prior experimental literature, we find that Claude Code succeeds in automating all stages of a typical analysis: event selection, background estimation, uncertainty quantification, statistical inference, and paper drafting. We argue that the experimental HEP community is underestimating the current capabilities of these systems, and that most proposed agentic workflows are too narrowly scoped or scaffolded to specific analysis structures. We present a proof-of-concept framework, Just Furnish Context (JFC), that integrates autonomous analysis agents with literature-based knowledge retrieval and multi-agent review, and show that this is sufficient to plan, execute, and document a credible high energy physics analysis. We demonstrate this by conducting analyses on open data from ALEPH, DELPHI, and CMS to perform electroweak, QCD, and Higgs boson measurements. Rather than replacing physicists, these tools promise to offload the repetitive technical burden of analysis code development, freeing researchers to focus on physics insight, truly novel method development, and rigorous validation. Given these developments, we advocate for new strategies for how the community trains students, organizes analysis efforts, and allocates human expertise.
Problem

Research questions and friction points this paper is trying to address.

AI agents
high energy physics
autonomous analysis
analysis automation
large language models
Innovation

Methods, ideas, or system contributions that make the work stand out.

AI agents
autonomous analysis
high energy physics
large language models
Just Furnish Context
๐Ÿ”Ž Similar Papers
2024-09-10arXiv.orgCitations: 3
2024-08-28Journal of Chemical Information and ModelingCitations: 1
Eric A. Moreno
Eric A. Moreno
Massachusetts Institute of Technology, CERN, LIGO, Fermilab
Machine LearningHigh Energy PhysicsParticle PhysicsGravitational Waves
S
Samuel Bright-Thonney
Department of Physics, Massachusetts Institute of Technology; NSF AI Institute for Artificial Intelligence and Fundamental Interactions
A
Andrzej Novak
Department of Physics, Massachusetts Institute of Technology; NSF AI Institute for Artificial Intelligence and Fundamental Interactions
D
Dolores Garcia
CERN
Philip Harris
Philip Harris
MIT
Machine LearningDark MatterHiggs bosonGravitational WavesFPGAs