Instance Dependent Testing of Samplers using Interval Conditioning

📅 2025-12-06

📈 Citations: 0

✨ Influential: 0

career value

232K/year

🤖 AI Summary

Distribution verification of samplers over infinite domains (e.g., the natural numbers) has long been hindered by worst-case computational complexity, rendering existing methods impractical. This paper introduces the first instance-dependent, efficient testing framework, built upon an interval-conditioning mechanism that jointly estimates continuous distribution mass and the distance between unknown and known distributions—thereby decoupling test efficiency from worst-case guarantees. The method supports rigorous statistical verification of samplers over arbitrary countably infinite domains while ensuring computational tractability. Empirical evaluation demonstrates up to 1000× speedup over state-of-the-art approaches across diverse distribution families, significantly improving practicality, scalability, and deployability of sampler validation.

Technology Category

Application Category

📝 Abstract

Sampling algorithms play a pivotal role in probabilistic AI. However, verifying if a sampler program indeed samples from the claimed distribution is a notoriously hard problem. Provably correct testers like Barbarik, Teq, Flash, CubeProbe for testing of different kinds of samplers were proposed only in the last few years. All these testers focus on the worst-case efficiency, and do not support verification of samplers over infinite domains, a case occurring frequently in Astronomy, Finance, Network Security, etc. In this work, we design the first tester of samplers with instance-dependent efficiency, allowing us to test samplers over natural numbers. Our tests are developed via a novel distance estimation algorithm between an unknown and a known probability distribution using an interval conditioning framework. The core technical contribution is a new connection with probability mass estimation of a continuous distribution. The practical gains are also substantial: our experiments establish up to 1000x speedup over state-of-the-art testers.

Problem

Research questions and friction points this paper is trying to address.

Verifying sampler correctness over infinite domains

Developing instance-dependent efficiency for sampler testing

Enabling testing of samplers on natural numbers

Innovation

Methods, ideas, or system contributions that make the work stand out.

Novel interval conditioning framework for distance estimation

First instance-dependent efficiency tester for infinite domains

New connection with probability mass estimation of continuous distributions

🔎 Similar Papers

Learning Deep Kernels for Non-Parametric Independence Testing