${ m P{small ROOF}W{small ALA}}$: Multilingual Proof Data Synthesis and Theorem-Proving

📅 2025-02-07

📈 Citations: 0

✨ Influential: 0

career value

164K/year

🤖 AI Summary

Existing neural theorem proving models rely heavily on a single interactive theorem prover (ITP), hindering cross-system transferability and limiting generalization and practical applicability. Method: We propose the first multi-ITP theorem proving framework supporting both Coq and Lean. It introduces a unified interaction protocol that models proof actions and state transitions as cross-language alignable, standardized representations; designs multi-ITP interfaces, parallelized search algorithms, and a cross-system proof-step synthesis mechanism; and enables joint multilingual training. Contribution/Results: Experiments demonstrate significant improvements over single-ITP baselines under the prove-at-k metric, providing the first empirical validation of effective knowledge transfer for neural theorem proving across heterogeneous ITPs. The complete codebase and framework are publicly released.

Technology Category

Application Category

📝 Abstract

Neural networks have shown substantial promise at automatic theorem-proving in interactive proof assistants (ITPs) like Lean and Coq. However, most neural theorem-proving models are restricted to specific ITPs, leaving out opportunities for cross-lingual $ extit{transfer}$ between ITPs. We address this weakness with a multilingual proof framework, ${ m P{small ROOF}W{small ALA}}$, that allows a standardized form of interaction between neural theorem-provers and two established ITPs (Coq and Lean). It enables the collection of multilingual proof step data -- data recording the result of proof actions on ITP states -- for training neural provers. ${ m P{small ROOF}W{small ALA}}$ allows the systematic evaluation of a model's performance across different ITPs and problem domains via efficient parallel proof search algorithms. We show that multilingual training enabled by ${ m P{small ROOF}W{small ALA}}$ can lead to successful transfer across ITPs. Specifically, a model trained on a mix of ${ m P{small ROOF}W{small ALA}}$-generated Coq and Lean data outperforms Lean-only and Coq-only models on the standard prove-at-$k$ metric. We open source all code including code for the $href{https://github.com/trishullab/proof-wala}{ProofWala; Framework}$, and the $href{https://github.com/trishullab/itp-interface}{Multilingual; ITP; interaction; framework}$.

Problem

Research questions and friction points this paper is trying to address.

Multilingual proof data synthesis

Cross-lingual transfer in theorem-proving

Neural theorem-proving performance evaluation

Innovation

Methods, ideas, or system contributions that make the work stand out.

Multilingual proof framework for ITPs

Standardized interaction between neural provers

Efficient parallel proof search algorithms

🔎 Similar Papers

RoMath: A Mathematical Reasoning Benchmark in Romanian

2024-09-17arXiv.orgCitations: 0