Revisiting Privacy Leakage in Machine Unlearning: Membership Inference Beyond the Forgotten Set

πŸ“… 2026-05-01
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF

career value

217K/year
πŸ€– AI Summary
This work reveals that machine unlearning can inadvertently heighten privacy risks for retained data, rendering it more susceptible to membership inference attacks. To systematically assess this effect, the authors propose TC-UMIAβ€”the first group-level membership inference attack framework tailored to three distinct sample categories: forgotten, retained, and unseen dataβ€”by analyzing discrepancies in model outputs before and after unlearning. This study extends membership inference attacks to retained data for the first time, uncovering a novel privacy leakage pathway introduced by machine unlearning and establishing a generalizable three-class attack paradigm. Extensive experiments across five state-of-the-art unlearning algorithms and six real-world datasets demonstrate that unlearning significantly increases the inferability of retained samples, with Dropout achieving the best trade-off between privacy preservation and model utility.
πŸ“ Abstract
Machine unlearning (MU) has emerged as a key mechanism for ensuring data privacy and regulatory compliance by enabling models to forget specific training samples. However, recent studies have shown that the removal of data can inadvertently introduce privacy leakages to the retain set,i.e., data that remain in the model after unlearning. In this paper, we extend the scope of privacy analysis in unlearning to the often-overlooked retained data. We introduce TC-UMIA, the first tri-class unlearning membership inference attack. TC-UMIA is a population-level inference framework that leverages model predictions before and after unlearning to distinguish among the forget, retain, and unseen set. Extensive experiments on five state-of-the-art unlearning algorithms and six real-world datasets demonstrate that: (i) unlearning can introduce additional privacy risks to the retain set, making it more susceptible to membership inference attacks; (ii) TC-UMIA is effective across a wide range of model architectures, datasets, and MU approaches. Beyond launching the attack, we rigorously evaluate three defense mechanisms, namely label-only outputs, dropout, and differential privacy, to mitigate the privacy risks posed by TC- UMIA. Our results reveal a fundamental trade-off between privacy protection and model accuracy, with the dropout approach offering the most favorable balance.
Problem

Research questions and friction points this paper is trying to address.

Machine Unlearning
Privacy Leakage
Membership Inference Attack
Retain Set
Data Privacy
Innovation

Methods, ideas, or system contributions that make the work stand out.

machine unlearning
membership inference attack
privacy leakage
retain set
tri-class inference
πŸ”Ž Similar Papers
No similar papers found.