Polynomial Multiproofs for Scalable Data Availability Sampling in Blockchain Light Clients

📅 2026-04-17
📈 Citations: 0
Influential: 0
📄 PDF

career value

230K/year
🤖 AI Summary
This work addresses the high bandwidth, storage, and verification overheads incurred by blockchain light clients in data availability sampling, where each sampled unit traditionally requires an individual KZG proof. The paper introduces, for the first time, a Polynomial Multi-Proof (PMP) mechanism to this setting, aggregating proofs for multiple sampled units into a single proof over a shared micro-domain. It further redesigns the proof generation, dissemination, retrieval, and verification pipeline within the P2P light client stack. Experimental evaluation on Avail’s modular data availability layer demonstrates that the proposed approach substantially reduces proof size, validator CPU and memory consumption, and infrastructure costs—by up to 45% compared to per-unit baselines—while also uncovering a critical trade-off between aggregation efficiency and security guarantees.

Technology Category

Application Category

📝 Abstract
Light clients are essential for scalable blockchain systems because they verify data availability without downloading full blocks. In data availability sampling based systems, sampled cells are retrieved from a peer-to-peer network and verified against cryptographic commitments. A common deployment pattern associates each sampled cell with an independent Kate-Zaverucha-Goldberg (KZG) proof, creating substantial cumulative bandwidth, storage, and verification overhead. This paper studies polynomial multiproofs (PMP) as a mechanism for reducing these costs in blockchain light clients. We present a design in which multiple sampled cell evaluations are verified using a single aggregated proof over a shared evaluation micro-domain and describe the corresponding changes to proof generation, dissemination, retrieval, and verification in a peer-to-peer light-client stack. We instantiate and evaluate the design in Avail, a modular data availability layer for blockchains, as a case study. The results show lower proof bytes, lower verifier CPU and memory usage, and deployment-level infrastructure cost reductions of up to 45% relative to a per-cell baseline, while also clarifying the trade-offs introduced by grouped retrieval.
Problem

Research questions and friction points this paper is trying to address.

data availability sampling
light clients
KZG proofs
blockchain scalability
proof overhead
Innovation

Methods, ideas, or system contributions that make the work stand out.

Polynomial Multiproofs
Data Availability Sampling
Light Clients
KZG Commitments
Proof Aggregation