Replicable Distribution Testing

📅 2025-07-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work systematically investigates the sample complexity of discrete distribution testing under the algorithmic reproducibility framework, focusing on reproducible testing of fundamental properties—closeness, independence, and uniformity. Methodologically, it introduces the first theoretical framework for reproducible distribution testing, unifying probabilistic analysis, reproducible algorithm design, and information-theoretic lower-bound techniques. It establishes the first general lower-bound proof methodology for reproducible testing and resolves, for the first time, the long-standing open problem of deriving a tight reproducibility lower bound for uniformity testing. Furthermore, it constructs reproducible algorithms for closeness and independence testing, and achieves nearly optimal sample complexity bounds for both uniformity and closeness testing. These results significantly advance the theoretical foundations of reproducible statistical learning.

Technology Category

Application Category

📝 Abstract
We initiate a systematic investigation of distribution testing in the framework of algorithmic replicability. Specifically, given independent samples from a collection of probability distributions, the goal is to characterize the sample complexity of replicably testing natural properties of the underlying distributions. On the algorithmic front, we develop new replicable algorithms for testing closeness and independence of discrete distributions. On the lower bound front, we develop a new methodology for proving sample complexity lower bounds for replicable testing that may be of broader interest. As an application of our technique, we establish near-optimal sample complexity lower bounds for replicable uniformity testing -- answering an open question from prior work -- and closeness testing.
Problem

Research questions and friction points this paper is trying to address.

Characterize sample complexity for replicable distribution testing
Develop replicable algorithms for testing closeness and independence
Establish lower bounds for replicable uniformity and closeness testing
Innovation

Methods, ideas, or system contributions that make the work stand out.

Replicable algorithms for distribution testing
New methodology for lower bounds
Near-optimal sample complexity bounds
🔎 Similar Papers
No similar papers found.