🤖 AI Summary
This work addresses the challenge of verifying the correctness of reconstructed samples in gradient inversion attacks within federated learning, particularly the lack of reliable evaluation metrics for tabular data. The authors propose a verifiable gradient inversion attack that constructs a geometric hyperplane model based on ReLU activation boundaries and employs a subspace algebraic verification mechanism to determine whether the input space contains only a single record. Leveraging this guarantee, the method analytically recovers feature vectors followed by lightweight optimization. It provides the first explicit correctness certificate for gradient inversion attacks, overcoming the limitations of existing approaches that rely on manual inspection or lack quantifiable fidelity measures. Experiments demonstrate that the method achieves accurate and verifiable reconstruction of original records on large-batch tabular data, significantly outperforming state-of-the-art attacks—especially in scenarios where prior methods fail or produce unverifiable results.
📝 Abstract
Gradient inversion attacks threaten client privacy in federated learning by reconstructing training samples from clients' shared gradients. Gradients aggregate contributions from multiple records and existing attacks may fail to disentangle them, yielding incorrect reconstructions with no intrinsic way to certify success. In vision and language, attackers may fall back on human inspection to judge reconstruction plausibility, but this is far less feasible for numerical tabular records, fueling the impression that tabular data is less vulnerable.
We challenge this perception by proposing a verifiable gradient inversion attack (VGIA) that provides an explicit certificate of correctness for reconstructed samples. Our method adopts a geometric view of ReLU leakage: the activation boundary of a fully connected layer defines a hyperplane in input space. VGIA introduces an algebraic, subspace-based verification test that detects when a hyperplane-delimited region contains exactly one record. Once isolation is certified, VGIA recovers the corresponding feature vector analytically and reconstructs the target via a lightweight optimization step.
Experiments on tabular benchmarks with large batch sizes demonstrate exact record and target recovery in regimes where existing state-of-the-art attacks either fail or cannot assess reconstruction fidelity. Compared to prior geometric approaches, VGIA allocates hyperplane queries more effectively, yielding faster reconstructions with fewer attack rounds.