LeanTutor: Towards a Verified AI Mathematical Proof Tutor

📅 2026-01-24
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work proposes a verifiable intelligent tutoring system for mathematical proofs that synergistically combines the strengths of large language models (LLMs) and formal proof assistants like Lean. Addressing the limitations of LLMs—prone to errors in mathematical reasoning—and the steep learning curve associated with formal provers, the system integrates three core components: automatic formalization, proof checking, next-step proof generation, and natural language feedback. It uniquely bridges the interactive flexibility of LLMs with the rigorous verifiability of theorem provers, achieving both user-friendliness and reliability. The authors implement a prototype system, LeanTutor, and introduce PeanoBench, a bilingual dataset comprising 371 proofs in Peano arithmetic, to evaluate their approach. This work establishes a new paradigm for verifiable AI-driven tutoring in formal mathematics.

Technology Category

Application Category

📝 Abstract
This paper considers the development of an AI-based provably-correct mathematical proof tutor. While Large Language Models (LLMs) allow seamless communication in natural language, they are error prone. Theorem provers such as Lean allow for provable-correctness, but these are hard for students to learn. We present a proof-of-concept system (LeanTutor) by combining the complementary strengths of LLMs and theorem provers. LeanTutor is composed of three modules: (i) an autoformalizer/proof-checker, (ii) a next-step generator, and (iii) a natural language feedback generator. To evaluate the system, we introduce PeanoBench, a dataset of 371 Peano Arithmetic proofs in human-written natural language and formal language, derived from the Natural Numbers Game.
Problem

Research questions and friction points this paper is trying to address.

AI tutoring
mathematical proof
formal verification
natural language interaction
correctness assurance
Innovation

Methods, ideas, or system contributions that make the work stand out.

verified AI
theorem proving
autoformalization
mathematical tutoring
PeanoBench
🔎 Similar Papers
No similar papers found.
M
Manooshree Patel
University of California, Berkeley
R
Rayna Bhattacharyya
University of California, Berkeley
T
Thomas Lu
University of California, Berkeley
A
Arnav Mehta
University of California, Berkeley
N
Niels Voss
University of California, Berkeley
Narges Norouzi
Narges Norouzi
EECS, University of California Berkeley
Machine LearningAI Education
G
G. Ranade
University of California, Berkeley