Automated Conjecture Resolution with Formal Verification

📅 2026-04-04
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the ambiguity inherent in natural language reasoning when tackling research-level mathematical problems, which often undermines the reliability of both solution generation and verification. To overcome this limitation, the authors propose a novel collaborative framework that integrates an informal reasoning agent, Rethlas, with a formal verification agent, Archon. Rethlas explores solution strategies, while Archon automatically translates them into machine-checkable proofs in Lean 4. This framework achieves, for the first time, end-to-end automated solving and formal verification of open mathematical conjectures by synergistically combining large language models, theorem retrieval systems (Matlas and LeanSearch), structured task decomposition, iterative refinement, and automated proof synthesis. The approach successfully resolves an open problem in commutative algebra and produces a formally verified proof in Lean 4 with minimal human intervention, establishing a new paradigm for the collaboration between informal and formal mathematical reasoning.
📝 Abstract
Recent advances in large language models have significantly improved their ability to perform mathematical reasoning, extending from elementary problem solving to increasingly capable performance on research-level problems. However, reliably solving and verifying such problems remains challenging due to the inherent ambiguity of natural language reasoning. In this paper, we propose an automated framework for tackling research-level mathematical problems that integrates natural language reasoning with formal verification, enabling end-to-end problem solving with minimal human intervention. Our framework consists of two components: an informal reasoning agent, Rethlas, and a formal verification agent, Archon. Rethlas mimics the workflow of human mathematicians by combining reasoning primitives with our theorem search engine, Matlas, to explore solution strategies and construct candidate proofs. Archon, equipped with our formal theorem search engine LeanSearch, translates informal arguments into formalized Lean 4 projects through structured task decomposition, iterative refinement, and automated proof synthesis, ensuring machine-checkable correctness. Using this framework, we automatically resolve an open problem in commutative algebra and formally verify the resulting proof in Lean 4 with essentially no human involvement. Our experiments demonstrate that strong theorem retrieval tools enable the discovery and application of cross-domain mathematical techniques, while the formal agent is capable of autonomously filling nontrivial gaps in informal arguments. More broadly, our work illustrates a promising paradigm for mathematical research in which informal and formal reasoning systems, equipped with theorem retrieval tools, operate in tandem to produce verifiable results, substantially reduce human effort, and offer a concrete instantiation of human-AI collaborative mathematical research.
Problem

Research questions and friction points this paper is trying to address.

automated conjecture resolution
formal verification
mathematical reasoning
natural language ambiguity
research-level mathematics
Innovation

Methods, ideas, or system contributions that make the work stand out.

automated conjecture resolution
formal verification
large language models
theorem retrieval
Lean 4
🔎 Similar Papers
No similar papers found.
H
Haocheng Ju
School of Mathematical Sciences, Peking University
G
Guoxiong Gao
School of Mathematical Sciences, Peking University
J
Jiedong Jiang
Westlake Institute for Advanced Study, Westlake University
B
Bin Wu
School of Mathematics, Tianjin University
Z
Zeming Sun
School of Mathematical Sciences, Peking University
L
Leheng Chen
School of Mathematical Sciences, Peking University
Y
Yutong Wang
School of Mathematical Sciences, Peking University
Yuefeng Wang
Yuefeng Wang
Northeastern University
Open-World Object Detection
Z
Zichen Wang
School of Mathematical Sciences, Peking University
W
Wanyi He
School of Mathematical Sciences, Peking University
P
Peihao Wu
IQuest Research
L
Liang Xiao
New Cornerstone Science Laboratory, School of Mathematical Sciences, Peking University
R
Ruochuan Liu
New Cornerstone Science Laboratory, School of Mathematical Sciences, Peking University
B
Bryan Dai
IQuest Research
Bin Dong
Bin Dong
Beijing International Center for Mathematical Research, Peking University
Scientific ComputingComputational ImagingMachine Learning