Unified Concept Editing in Diffusion Models

Gandikota, Rohit; Orgad, Hadas; Belinkov, Yonatan; Materzyńska, Joanna; Bau, David

Rohit Gandikota, Hadas Orgad, Yonatan Belinkov, Joanna Materzyńska, David Bau; Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024, pp. 5111-5120

Abstract

Text-to-image models suffer from various safety issues that may limit their suitability for deployment. Previous methods have separately addressed individual issues of bias, copyright, and offensive content in text-to-image models. However, in the real world, all of these issues appear simultaneously in the same model. We present a method that tackles all issues with a single approach. Our method, Unified Concept Editing (UCE), edits the model without training using a closed-form solution, and scales seamlessly to concurrent edits on text-conditional diffusion models. We demonstrate scalable simultaneous debiasing, style erasure, and content moderation by editing text-to-image projections, and we present extensive experiments demonstrating improved efficacy and scalability over prior work.

Related Material

[pdf] [supp] [arXiv]

[bibtex]

@InProceedings{Gandikota_2024_WACV, author = {Gandikota, Rohit and Orgad, Hadas and Belinkov, Yonatan and Materzy\'nska, Joanna and Bau, David}, title = {Unified Concept Editing in Diffusion Models}, booktitle = {Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)}, month = {January}, year = {2024}, pages = {5111-5120} }