Towards Language Model Guided TLA+ Proof Automation

📅 2025-12-10

📈 Citations: 0

✨ Influential: 0

career value

159K/year

🤖 AI Summary

TLA+ formal verification faces challenges including high manual proof effort, strict syntax constraints, and complex hierarchical structure, hindering automation. To address these, this paper proposes an LLM-driven, hierarchically guided proof automation framework. Our method constrains the LLM to generate only standardized subgoal decompositions—rather than full proofs—to avoid TLA+ syntax errors; introduces a hierarchical proposition decomposition mechanism integrated with an LLM-Symbolic Prover co-architecture; and constructs, for the first time, a dual-source benchmark suite comprising 119 theorems drawn from distributed protocols and mathematical induction. Experimental results demonstrate that our approach significantly outperforms all baselines across all theorems, achieving substantial improvements in subgoal decomposition accuracy and verification success rate, while markedly reducing error rates.

Technology Category

Application Category

📝 Abstract

Formal theorem proving with TLA+ provides rigorous guarantees for system specifications, but constructing proofs requires substantial expertise and effort. While large language models have shown promise in automating proofs for tactic-based theorem provers like Lean, applying these approaches directly to TLA+ faces significant challenges due to the unique hierarchical proof structure of the TLA+ proof system. We present a prompt-based approach that leverages LLMs to guide hierarchical decomposition of complex proof obligations into simpler sub-claims, while relying on symbolic provers for verification. Our key insight is to constrain LLMs to generate normalized claim decompositions rather than complete proofs, significantly reducing syntax errors. We also introduce a benchmark suite of 119 theorems adapted from (1) established mathematical collections and (2) inductive proofs of distributed protocols. Our approach consistently outperforms baseline methods across the benchmark suite.

Problem

Research questions and friction points this paper is trying to address.

Automates TLA+ proof construction using LLMs

Handles hierarchical proof structures via decomposition

Reduces syntax errors with normalized claim generation

Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM-guided hierarchical decomposition of proof obligations

Constrained generation of normalized claim decompositions

Integration of symbolic provers for verification

🔎 Similar Papers

No similar papers found.