FormalSpecCpp: A Dataset of C++ Formal Specifications created using LLMs

📅 2025-02-21

📈 Citations: 0

✨ Influential: 0

career value

169K/year

🤖 AI Summary

The C++ ecosystem lacks standardized benchmark datasets for empirically evaluating tools that infer or verify formal specifications. Method: FormalSpecCpp introduces the first open-source, C++-specific formal specification benchmark, comprising numerous programs annotated with rigorously defined preconditions and postconditions, fully compliant with ISO C++ syntax and contract-based programming idioms. Specifications are generated via large language models (LLMs), then manually verified and systematically annotated to ensure semantic fidelity and syntactic consistency. Contribution/Results: This dataset fills a critical gap in empirical research on formal methods for C++, enabling reproducible, scalable evaluation of specification inference tools, program verification algorithms, and LLMs’ capabilities in formal software development—including fine-tuning, generalization, and specification synthesis. By providing a rigorous, community-accessible standard, FormalSpecCpp significantly enhances methodological rigor and cross-study comparability in this domain.

Technology Category

Application Category

📝 Abstract

FormalSpecCpp is a dataset designed to fill the gap in standardized benchmarks for verifying formal specifications in C++ programs. To the best of our knowledge, this is the first comprehensive collection of C++ programs with well-defined preconditions and postconditions. It provides a structured benchmark for evaluating specification inference tools and testing theaccuracy of generated specifications. Researchers and developers can use this dataset to benchmark specification inference tools,fine-tune Large Language Models (LLMs) for automated specification generation, and analyze the role of formal specifications in improving program verification and automated testing. By making this dataset publicly available, we aim to advance research in program verification, specification inference, and AI-assisted software development. The dataset and the code are available at https://github.com/MadhuNimmo/FormalSpecCpp.

Problem

Research questions and friction points this paper is trying to address.

Filling the gap in C++ formal specification benchmarks

Providing structured benchmark for specification inference tools

Advancing research in AI-assisted software development

Innovation

Methods, ideas, or system contributions that make the work stand out.

LLMs generate C++ formal specifications

First comprehensive C++ precondition/postcondition dataset

Benchmark for specification inference tools

🔎 Similar Papers

SpecGen: Automated Generation of Formal Program Specifications via Large Language Models

2024-01-16arXiv.orgCitations: 9

An Empirical Evaluation of Pre-trained Large Language Models for Repairing Declarative Formal Specifications

2024-04-17arXiv.orgCitations: 6