Consistent Estimation of Numerical Distributions under Local Differential Privacy by Wavelet Expansion

📅 2025-09-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In local differential privacy (LDP) settings, existing methods for numerical distribution estimation often misallocate probability mass to locations far from true values. To address this, this paper introduces wavelet expansion into the LDP numerical distribution estimation framework—the first such approach. Our method hierarchically protects wavelet coefficients, prioritizing high-accuracy estimation of low-order (coarse-scale) coefficients to preserve the global shape of the underlying distribution and effectively suppress long-range probability misplacement. We theoretically establish consistency of the proposed estimator under both Wasserstein and Kolmogorov–Smirnov distances. Empirical evaluations demonstrate that our method achieves significantly higher estimation accuracy than state-of-the-art LDP techniques under both metrics. This work establishes a new paradigm for high-fidelity numerical distribution modeling under rigorous privacy constraints.

Technology Category

Application Category

📝 Abstract
Distribution estimation under local differential privacy (LDP) is a fundamental and challenging task. Significant progresses have been made on categorical data. However, due to different evaluation metrics, these methods do not work well when transferred to numerical data. In particular, we need to prevent the probability mass from being misplaced far away. In this paper, we propose a new approach that express the sample distribution using wavelet expansions. The coefficients of wavelet series are estimated under LDP. Our method prioritizes the estimation of low-order coefficients, in order to ensure accurate estimation at macroscopic level. Therefore, the probability mass is prevented from being misplaced too far away from its ground truth. We establish theoretical guarantees for our methods. Experiments show that our wavelet expansion method significantly outperforms existing solutions under Wasserstein and KS distances.
Problem

Research questions and friction points this paper is trying to address.

Estimating numerical distributions accurately under local differential privacy constraints
Preventing probability mass misplacement far from ground truth in LDP settings
Overcoming limitations of categorical data methods for numerical data estimation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Wavelet expansion for distribution representation
Prioritizing low-order coefficient estimation
Local differential privacy with theoretical guarantees
🔎 Similar Papers
No similar papers found.
P
Puning Zhao
Shenzhen Campus of Sun Yat-sen University, Guangdong Key Laboratory of Information Security Technology
Zhikun Zhang
Zhikun Zhang
Assistant Professor, Zhejiang University
Trustworthy AIData PrivacyDifferential Privacy
B
Bo Sun
Zhejiang University
L
Li Shen
Shenzhen Campus of Sun Yat-sen University
L
Liang Zhang
Shenzhen Campus of Sun Yat-sen University
S
Shaowei Wang
Guangzhou University
Z
Zhe Liu
Zhejiang University