Consistent Estimation of Numerical Distributions under Local Differential Privacy by Wavelet Expansion

📅 2025-09-23

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

In local differential privacy (LDP) settings, existing methods for numerical distribution estimation often misallocate probability mass to locations far from true values. To address this, this paper introduces wavelet expansion into the LDP numerical distribution estimation framework—the first such approach. Our method hierarchically protects wavelet coefficients, prioritizing high-accuracy estimation of low-order (coarse-scale) coefficients to preserve the global shape of the underlying distribution and effectively suppress long-range probability misplacement. We theoretically establish consistency of the proposed estimator under both Wasserstein and Kolmogorov–Smirnov distances. Empirical evaluations demonstrate that our method achieves significantly higher estimation accuracy than state-of-the-art LDP techniques under both metrics. This work establishes a new paradigm for high-fidelity numerical distribution modeling under rigorous privacy constraints.

Technology Category

Application Category

📝 Abstract

Distribution estimation under local differential privacy (LDP) is a fundamental and challenging task. Significant progresses have been made on categorical data. However, due to different evaluation metrics, these methods do not work well when transferred to numerical data. In particular, we need to prevent the probability mass from being misplaced far away. In this paper, we propose a new approach that express the sample distribution using wavelet expansions. The coefficients of wavelet series are estimated under LDP. Our method prioritizes the estimation of low-order coefficients, in order to ensure accurate estimation at macroscopic level. Therefore, the probability mass is prevented from being misplaced too far away from its ground truth. We establish theoretical guarantees for our methods. Experiments show that our wavelet expansion method significantly outperforms existing solutions under Wasserstein and KS distances.

Problem

Research questions and friction points this paper is trying to address.

Estimating numerical distributions accurately under local differential privacy constraints

Preventing probability mass misplacement far from ground truth in LDP settings

Overcoming limitations of categorical data methods for numerical data estimation

Innovation

Methods, ideas, or system contributions that make the work stand out.

Wavelet expansion for distribution representation

Prioritizing low-order coefficient estimation

Local differential privacy with theoretical guarantees

🔎 Similar Papers

No similar papers found.

Authors to Follow