ExVerus: Verus Proof Repair via Counterexample Reasoning

πŸ“… 2026-03-26
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Current large language models treat proof generation in formal verification as a static end-to-end prediction task, which precludes the use of program execution feedback and thereby limits their proving capabilities. This work proposes the first large language model framework that integrates counterexample-guided reasoning: upon verification failure, the system automatically generates and validates concrete counterexamples, then leverages them to guide the model in generalizing inductive invariants for proof repair. By introducing dynamic, behavior-aware counterexample reasoning into large language model–driven formal verification, this approach significantly enhances the accuracy, robustness, and token efficiency of proof generation in Verus, outperforming state-of-the-art prompting strategies.
πŸ“ Abstract
Large Language Models (LLMs) have shown promising results in automating formal verification. However, existing approaches treat proof generation as a static, end-to-end prediction over source code, relying on limited verifier feedback and lacking access to concrete program behaviors. We present EXVERUS, a counterexample-guided framework that enables LLMs to reason about proofs using behavioral feedback via counterexamples. When a proof fails, EXVERUS automatically generates and validates counterexamples, and then guides the LLM to generalize them into inductive invariants to block these failures. Our evaluation shows that EXVERUS significantly improves proof accuracy, robustness, and token efficiency over the state-of-the-art prompting-based Verus proof generator.
Problem

Research questions and friction points this paper is trying to address.

formal verification
large language models
counterexample reasoning
proof repair
inductive invariants
Innovation

Methods, ideas, or system contributions that make the work stand out.

counterexample-guided reasoning
large language models
formal verification
inductive invariants
proof repair
πŸ”Ž Similar Papers
No similar papers found.
Jun Yang
Jun Yang
Professor, Electrical and Computer Engineering, University of Pittsburgh
Computer ArchitectureCompiler TechnologyElectronic Design Automation
Y
Yuechun Sun
Department of Computer Science, The University of Chicago
Y
Yi Wu
Department of Computer Science, The University of Chicago
R
Rodrigo Caridad
Department of Computer Science, The University of Chicago
Y
Yongwei Yuan
Department of Computer Science, Purdue University
J
Jianan Yao
Department of Electrical and Computer Engineering, The University of Toronto
Shan Lu
Shan Lu
Professor of Computer Science, University of Chicago
Computer SystemsSoftware ReliabilityProgram AnalysisConcurrency
Kexin Pei
Kexin Pei
Assistant Professor, Computer Science, University of Chicago
SecuritySoftware EngineeringMachine Learning