CUIG

An Empirical Exploration of
Continual Unlearning for Image Generation

The Ohio State University, Michigan State University
ICML 2025 Workshop on Machine Unlearning for Generative AI

Abstract

Machine unlearning---the ability to remove designated concepts from a pre-trained model---has advanced swiftly. However, existing methods typically assume that unlearning requests arrive all at once, whereas in practice, they often occur sequentially. In this paper, we present the first systematic study of continual unlearning in text-to-image generation. We show that popular unlearning methods suffer from rapid retention failures: after only a few requests, the model drastically forgets retained knowledge and produces degraded images. Our analysis attributes this behavior to cumulative parameter drift, which causes successive unlearned models to progressively diverge from the pre-training manifold. Motivated by this insight, we investigate add-on mechanisms that (1) mitigate drift and (2) crucially, remain compatible with existing unlearning methods. Extensive experiments demonstrate that constraining model updates and merging independently unlearned models are effective solutions, suggesting promising directions for future exploration. Taken together, our study positions continual unlearning as a fundamental problem in image generation, offering insights, accessible baselines, and open challenges to advance safe and accountable generative AI.

@inproceedings{lee2025empirical, title={An Empirical Exploration of Continual Unlearning for Image Generation}, author={Lee, Justin and Mai, Zheda and Fan, Chongyu and Chao, Wei-Lun}, booktitle={ICML 2025 Workshop on Machine Unlearning for Generative AI} }

An Empirical Exploration of
Continual Unlearning for Image Generation

In real-world applications, unlearning requests often arrive sequentially. For example, when artists request the removal of their style or companies request the removal of copyrighted characters.

Unlearning a concept from the previously unlearned model checkpoint, leads to model degradation, the unintentional unlearning of other concepts. This occurs for unlearning both styles (left) and objects (right).

Abstract

Sequential vs Simultaneous

We find that simultaneous unlearning uses much smaller parameter updates. The x-axis shows the parameters updated during unlearning, with more red indicating a greater L2 distance from the base model parameters. This motivates us to explore methods that constrain update magnitude.

Exploring Methods to Constrain Update Magnitude

Developing New Methods

We develop a new method that builds a text embedding subspace of concepts we want to retain. Then we project the unelarning gradient to be orthogonal to this subspace. This allows us to unlearn the target concept while retaining the concepts we want to keep.

Poster

BibTeX