🤖 AI Summary
This paper addresses individual-level privacy protection for aggregate queries in multi-user dynamic systems—where users frequently join, leave, and update their data. To this end, it extends Pufferfish privacy to such dynamic settings for the first time. The authors propose a Laplace noise calibration mechanism grounded in the Wasserstein metric (i.e., the first-order Kantorovich distance), and establish a multi-user sensitive-pair model alongside a statistical indistinguishability analysis framework. Theoretically, they prove that an individual’s privacy budget depends solely on their own data distribution, independent of others’. For Bernoulli variables, they introduce a relaxable noise condition and derive sufficient conditions for Pufferfish privacy under four canonical secret-pair families. Experiments demonstrate that the method substantially reduces required noise magnitude for binary variables, thereby improving query accuracy and data utility—while strictly preserving Pufferfish privacy.
📝 Abstract
This paper studies how to achieve individual indistinguishability by pufferfish privacy in aggregated query to a multi-user system. It is assumed that each user reports realization of a random variable. We study how to calibrate Laplace noise, added to the query answer, to attain pufferfish privacy when user changes his/her reported data value, leaves the system and is replaced by another use with different randomness. Sufficient conditions are derived for all scenarios for attaining statistical indistinguishability on four sets of secret pairs. They are derived using the existing Kantorovich method (Wasserstain metric of order $1$). These results can be applied to attain indistinguishability when a certain class of users is added or removed from a tabular data. It is revealed that attaining indifference in individual's data is conditioned on the statistics of this user only. For binary (Bernoulli distributed) random variables, the derived sufficient conditions can be further relaxed to reduce the noise and improve data utility.