Inverting the Generation Process of Denoising Diffusion Implicit Models: Empirical Evaluation and a Novel Method

Yan Zeng, Masanori Suganuma, Takayuki Okatani; Proceedings of the Winter Conference on Applications of Computer Vision (WACV), 2025, pp. 4616-4624

Abstract


This paper studies the problem of inverting the DDIM image generation process to recover latent variables particularly the initial noise map from a generated image. Existing methods often struggle with accuracy in this task. We propose a novel hybrid approach that combines direct inversion via gradient descent for the first step followed by a fixed-point method for subsequent steps. Empirical evaluations across three datasets demonstrate that our method significantly improves the prediction of initial latent variables while achieving superior reconstruction accuracy. Additionally we introduce a new evaluation called the self-interpolation test which assesses the quality of images generated from interpolated points between the true and predicted latent maps offering deeper insights into performance. Our results reveal that while existing methods perform reasonably well in reconstruction they consistently fail to accurately predict the initial latent variables resulting in poor performance on the self-interpolation test. In contrast our method outperforms all others across all metrics providing valuable insights into diffusion models and enhancing their applications in image generation and editing.

Related Material


[pdf]
[bibtex]
@InProceedings{Zeng_2025_WACV, author = {Zeng, Yan and Suganuma, Masanori and Okatani, Takayuki}, title = {Inverting the Generation Process of Denoising Diffusion Implicit Models: Empirical Evaluation and a Novel Method}, booktitle = {Proceedings of the Winter Conference on Applications of Computer Vision (WACV)}, month = {February}, year = {2025}, pages = {4616-4624} }