SpotDiffusion: A Fast Approach for Seamless Panorama Generation Over Time

Stanislav Frolov, Brian B. Moser, Andreas Dengel; Proceedings of the Winter Conference on Applications of Computer Vision (WACV), 2025, pp. 2073-2081

Abstract


Generating high-resolution images with generative models has recently been made widely accessible by leveraging diffusion models pre-trained on large-scale datasets. Various techniques such as MultiDiffusion and SyncDiffusion have further pushed image generation beyond training resolutions i.e. from square images to panorama by merging multiple overlapping diffusion paths or employing gradient descent to maintain perceptual coherence. However these methods suffer from significant computational inefficiencies due to generating and averaging numerous predictions which is required in practice to produce high-quality and seamless images. This work addresses this limitation and presents a novel approach that eliminates the need to generate and average numerous overlapping denoising predictions. Our method shifts non-overlapping denoising windows over time ensuring that seams in one timestep are corrected in the next. This results in coherent high-resolution images with fewer overall steps. We demonstrate the effectiveness of our approach through qualitative and quantitative evaluations comparing it with MultiDiffusion SyncDiffusion and StitchDiffusion. Our method offers several key benefits including improved computational efficiency and faster inference times while producing comparable or better image quality.

Related Material


[pdf] [arXiv]
[bibtex]
@InProceedings{Frolov_2025_WACV, author = {Frolov, Stanislav and Moser, Brian B. and Dengel, Andreas}, title = {SpotDiffusion: A Fast Approach for Seamless Panorama Generation Over Time}, booktitle = {Proceedings of the Winter Conference on Applications of Computer Vision (WACV)}, month = {February}, year = {2025}, pages = {2073-2081} }