🤖 AI Summary
This paper investigates the statistical limits of exact community recovery in $h$-uniform hypergraphs under hyperedge-level privacy constraints. Building upon the $h$-uniform Hypergraph Stochastic Block Model ($h$-HSBM), we introduce the first formal definition of $(varepsilon,delta)$-hyperedge differential privacy, a unified generalization of edge-level DP. Methodologically, we rigorously characterize the threshold shrinkage for exact recovery under three privatization mechanisms—stability mechanism, Bayesian sampling, and randomized response—and derive closed-form expressions for their respective recovery thresholds. We prove that the latter two mechanisms achieve pure $varepsilon$-DP. Key results show that threshold degradation scales logarithmically with the intra-to-inter-cluster density ratio and linearly with hypergraph size, both governed by the privacy budget $varepsilon$. Our work establishes fundamental theoretical benchmarks and principled design guidelines for privacy-preserving community detection in higher-order relational data.
📝 Abstract
This paper studies the exact recovery threshold subject to preserving the privacy of connections in $h$-uniform hypergraphs. Privacy is characterized by the $(ε, δ)$-hyperedge differential privacy (DP), an extension of the notion of $(ε, δ)$-edge DP in the literature. The hypergraph observations are modeled through a $h$-uniform stochastic block model ($h$-HSBM) in the dense regime. We investigate three differentially private mechanisms: stability-based, sampling-based, and perturbation-based mechanisms. We calculate the exact recovery threshold for each mechanism and study the contraction of the exact recovery region due to the privacy budget, $(ε, δ)$. Sampling-based mechanisms and randomized response mechanisms guarantee pure $ε$-hyperedge DP where $δ=0$, while the stability-based mechanisms cannot achieve this level of privacy. The dependence of the limits of the privacy budget on the parameters of the $h$-uniform hypergraph is studied. More precisely, it is proven rigorously that the minimum privacy budget scales logarithmically with the ratio between the density of in-cluster hyperedges and the cross-cluster hyperedges for stability-based and Bayesian sampling-based mechanisms, while this budget depends only on the size of the hypergraph for the randomized response mechanism.