Easy to Learn, Yet Hard to Forget: Towards Robust Unlearning Under Bias
Machine unlearning, which enables a model to forget specific data, is crucial for ensuring data privacy and model reliability. However, its effectiveness can be severely undermined in real-world scenarios where models learn unintended biases from spurious correlations within the data. This paper investigates the unique challenges of unlearning from such biased models. We identify a novel phenomenon we term “shortcut unlearning," where models exhibit an “easy to learn, yet hard to forget" tendency. Specifically, models struggle to forget […]