Sal: Multi-modal Verification of Replicated Data Types

📅 2026-03-28

📈 Citations: 0

✨ Influential: 0

career value

149K/year

🤖 AI Summary

This work addresses the challenge of ensuring semantic correctness in replicated data types (RDTs) under independent evolution and merging by introducing Sal, a Lean-based multimodal verification workflow for the formal design and verification of state-based CRDTs and MRDTs. Sal innovatively integrates kernel-verifiable automation, SMT-assisted proving, and AI-enhanced interactive theorem proving, complemented by QuickCheck-style property testing to automatically generate and visualize counterexamples. This approach substantially reduces the trusted computing base and enhances verification transparency. In experiments, Sal successfully verified 13 RDTs, with 69% of verification conditions discharged automatically by the kernel without SMT assistance, and effectively uncovered subtle bugs such as the enable-wins flag anomaly.

Technology Category

Application Category

📝 Abstract

Designing correct replicated data types (RDTs) is challenging because replicas evolve independently and must be merged while preserving application intent. A promising approach is correct-by-construction development in a proof-oriented programming language such as F*, Dafny and Lean, where desired correctness guarantees are specified and checked as the RDTs are implemented. Recent work Neem proposes the use of replication-aware linearizability (RA linearizability) as the correctness condition for state-based CRDTs and mergeable replicated data types (MRDTs), with automation in the SMT-aided, proof-oriented programming language F*. However, SMT-centric workflows can be opaque when automation fails to discharge a verification condition (VC), and they enlarge the trusted computing base (TCB). We present Sal, a multi-modal workflow to design and verify state-based CRDTs and MRDTs in Lean. Sal combines (i) kernel-checkable automation with proof reconstruction, (ii) SMT-aided automation when needed, and (iii) AI-assisted interactive theorem proving for remaining proof obligations. When a verification condition is shown to be invalid, we leverage Lean's property-based testing to automatically generate and visualize counterexamples, helping developers debug incorrect specifications or implementations. We report on our experience verifying a suite of 13 CRDTs and MRDTs with Sal: 69% of verification conditions are discharged by kernel-verified automation without SMT, and counterexamples automatically expose subtle bugs such as the well-known enable-wins flag anomaly. The codebase for Sal is open-sourced, and is available at \href{https://github.com/fplaunchpad/sal}{https://github.com/fplaunchpad/sal}

Problem

Research questions and friction points this paper is trying to address.

replicated data types

correctness verification

CRDTs

MRDTs

formal verification

Innovation

Methods, ideas, or system contributions that make the work stand out.

multi-modal verification

replicated data types

proof reconstruction