Enabling AI ASICs for Zero Knowledge Proof

📅 2026-04-20
📈 Citations: 0
Influential: 0
📄 PDF

career value

219K/year
🤖 AI Summary
Zero-knowledge proofs (ZKPs) suffer from limited efficiency due to the high computational cost of multi-scalar multiplication (MSM) and number-theoretic transforms (NTT). This work proposes MORPH, a framework that, for the first time, reformulates core ZKP computations to align with the execution model of AI ASICs such as TPUs. By introducing a hardware-aware Big-T complexity model, an MXU-centric extended RNS lazy reduction scheme, and a unified tiling layout strategy, MORPH eliminates carry chains and avoids on-chip data reshuffling. Further optimizations include low-precision GEMM conversion via JAX, an enhanced Pippenger algorithm for MSM, and a 3/5-step NTT design, all tailored to maximize utilization of TPU matrix units. Evaluated on TPUv6e8, the approach achieves up to a 10× increase in NTT throughput and delivers MSM performance on par with specialized ZKP accelerators like GZKP.

Technology Category

Application Category

📝 Abstract
Zero-knowledge proof (ZKP) provers remain costly because multi-scalar multiplication (MSM) and number-theoretic transforms (NTTs) dominate runtime as they need significant computation. AI ASICs such as TPUs provide massive matrix throughput and SotA energy efficiency. We present MORPH, the first framework that reformulates ZKP kernels to match AI-ASIC execution. We introduce Big-T complexity, a hardware-aware complexity model that exposes heterogeneous bottlenecks and layout-transformation costs ignored by Big-O. Guided by this analysis, (1) at arithmetic level, MORPH develops an MXU-centric extended-RNS lazy reduction that converts high-precision modular arithmetic into dense low-precision GEMMs, eliminating all carry chains, and (2) at dataflow level, MORPH constructs a unified-sharding layout-stationary TPU Pippenger MSM and optimized 3/5-step NTT that avoid on-TPU shuffles to minimize costly memory reorganization. Implemented in JAX, MORPH enables TPUv6e8 to achieve up-to 10x higher throughput on NTT and comparable throughput on MSM than GZKP. Our code: https://github.com/EfficientPPML/MORPH.
Problem

Research questions and friction points this paper is trying to address.

Zero-knowledge proof
Multi-scalar multiplication
Number-theoretic transform
AI ASICs
Computational cost
Innovation

Methods, ideas, or system contributions that make the work stand out.

Zero-Knowledge Proof
AI ASIC
Matrix Multiplication Unit (MXU)
Number-Theoretic Transform
Multi-Scalar Multiplication
🔎 Similar Papers
No similar papers found.