🤖 AI Summary
In local differential privacy (LDP) settings, existing methods for numerical distribution estimation often misallocate probability mass to locations far from true values. To address this, this paper introduces wavelet expansion into the LDP numerical distribution estimation framework—the first such approach. Our method hierarchically protects wavelet coefficients, prioritizing high-accuracy estimation of low-order (coarse-scale) coefficients to preserve the global shape of the underlying distribution and effectively suppress long-range probability misplacement. We theoretically establish consistency of the proposed estimator under both Wasserstein and Kolmogorov–Smirnov distances. Empirical evaluations demonstrate that our method achieves significantly higher estimation accuracy than state-of-the-art LDP techniques under both metrics. This work establishes a new paradigm for high-fidelity numerical distribution modeling under rigorous privacy constraints.
📝 Abstract
Distribution estimation under local differential privacy (LDP) is a fundamental and challenging task. Significant progresses have been made on categorical data. However, due to different evaluation metrics, these methods do not work well when transferred to numerical data. In particular, we need to prevent the probability mass from being misplaced far away. In this paper, we propose a new approach that express the sample distribution using wavelet expansions. The coefficients of wavelet series are estimated under LDP. Our method prioritizes the estimation of low-order coefficients, in order to ensure accurate estimation at macroscopic level. Therefore, the probability mass is prevented from being misplaced too far away from its ground truth. We establish theoretical guarantees for our methods. Experiments show that our wavelet expansion method significantly outperforms existing solutions under Wasserstein and KS distances.