Differentially Private Wasserstein Barycenters

📅 2025-10-03

📈 Citations: 0

✨ Influential: 0

career value

204K/year

🤖 AI Summary

This paper addresses the computation of the Wasserstein barycenter under ε-differential privacy constraints. The Wasserstein barycenter—defined as the Fréchet mean of a set of probability distributions with respect to the Wasserstein metric—plays a key role in machine learning and statistical analysis; however, its input empirical distributions often derive from sensitive data, necessitating rigorous privacy protection. We propose the first ε-differentially private framework for Wasserstein barycenter computation: it injects noise into the empirical distributions guided by optimal transport geometry and refines the result via synthetic data optimization. Evaluated on real-world datasets—including MNIST and U.S. Census data—the method efficiently computes private barycenters while providing strict theoretical privacy guarantees. It substantially outperforms existing baselines, achieving the state-of-the-art privacy–accuracy trade-off. This work establishes a novel interdisciplinary bridge between differential privacy and optimal transport theory.

Technology Category

Application Category

📝 Abstract

The Wasserstein barycenter is defined as the mean of a set of probability measures under the optimal transport metric, and has numerous applications spanning machine learning, statistics, and computer graphics. In practice these input measures are empirical distributions built from sensitive datasets, motivating a differentially private (DP) treatment. We present, to our knowledge, the first algorithms for computing Wasserstein barycenters under differential privacy. Empirically, on synthetic data, MNIST, and large-scale U.S. population datasets, our methods produce high-quality private barycenters with strong accuracy-privacy tradeoffs.

Problem

Research questions and friction points this paper is trying to address.

Computing Wasserstein barycenters with differential privacy

Protecting sensitive data in probability distribution averaging

Developing private algorithms for optimal transport applications

Innovation

Methods, ideas, or system contributions that make the work stand out.

Differentially private algorithms for Wasserstein barycenters

Computes barycenters with strong accuracy-privacy tradeoffs

Applies to synthetic data, MNIST, and population datasets

🔎 Similar Papers

Private Wasserstein Distance