Delta Denoising Score

Amir Hertz, Kfir Aberman, Daniel Cohen-Or,; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023, pp. 2328-2337

Abstract


This paper introduces Delta Denoising Score (DDS), a novel diffusion-based scoring technique that optimizes a parametric model for the task of image editing. Unlike the existing Score Distillation Sampling (SDS), which queries the generative model with a single image-text pair, DDS utilizes an additional fixed query of a reference image-text pair to generate delta scores that represent the difference between the outputs of the two queries. By estimating noisy gradient directions introduced by SDS using the source image and its text description, DDS provides cleaner gradient directions that modify the edited portions of the image while leaving others unchanged, yielding a distilled edit of the source image. The analysis presented in this paper supports the power of the new score for image-to-image translation. We further show that the new score can be used to train an effective zero-shot image translation model. The experimental results show that the proposed loss term outperforms existing methods in terms of stability and quality, highlighting its potential for real-world applications.

Related Material


[pdf] [supp] [arXiv]
[bibtex]
@InProceedings{Hertz_2023_ICCV, author = {Hertz, Amir and Aberman, Kfir and Cohen-Or, Daniel}, title = {Delta Denoising Score}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2023}, pages = {2328-2337} }