🤖 AI Summary
This work addresses the general machine unlearning problem for vision classifiers under diverse data corruptions—including label noise, backdoor attacks, and semantic contamination—by proposing the first cross-corruption comparable two-dimensional conceptual space (discovery rate × statistical regularity) to unify unlearning task modeling. Methodologically, we introduce a novel “redirect–erase” paradigm: structural expansion incorporates dedicated neurons to dynamically capture corruption-specific responses, followed by precise erasure via freezing or pruning; this is complemented by corruption-response isolation and lightweight parameter adaptation. Evaluated across the full spectrum of corruption types, our approach significantly outperforms state-of-the-art methods, achieving an average 27.4% improvement in unlearning success rate and a 41.8% reduction in generalization error. To our knowledge, this is the first framework enabling universal, robust, and interpretable unlearning for vision models.
📝 Abstract
Machine unlearning is studied for a multitude of tasks, but specialization of unlearning methods to particular tasks has made their systematic comparison challenging. To address this issue, we propose a conceptual space to characterize diverse corrupted data unlearning tasks in vision classifiers. This space is described by two dimensions, the discovery rate (the fraction of the corrupted data that are known at unlearning time) and the statistical regularity of the corrupted data (from random exemplars to shared concepts). Methods proposed previously have been targeted at portions of this space and-we show-fail predictably outside these regions. We propose a novel method, Redirection for Erasing Memory (REM), whose key feature is that corrupted data are redirected to dedicated neurons introduced at unlearning time and then discarded or deactivated to suppress the influence of corrupted data. REM performs strongly across the space of tasks, in contrast to prior SOTA methods that fail outside the regions for which they were designed.