Adding Additional Control to One-Step Diffusion with Joint Distribution Matching

Luo, Yihong; Hu, Tianyang; Song, Yifan; Sun, Jiacheng; Li, Zhenguo; Tang, Jing

Yihong Luo, Tianyang Hu, Yifan Song, Jiacheng Sun, Zhenguo Li, Jing Tang; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2025, pp. 4009-4018

Abstract

While diffusion distillation has enabled one-step generation through methods like Variational Score Distillation, adapting distilled models to emerging *new controls* -- such as novel structural constraints or latest user preferences -- remains challenging. Conventional approaches typically requires modifying the base diffusion model and redistilling it -- a process that is both computationally intensive and time-consuming. To address these challenges, we introduce Joint Distribution Matching (JDM), a novel approach that minimizes the reverse KL divergence between image-condition joint distributions. By deriving a tractable upper bound, JDM decouples fidelity learning from condition learning. This asymmetric distillation scheme enables our one-step student to handle controls unknown to the teacher model and facilitates improved classifier-free guidance (CFG) usage and seamless integration of human feedback learning (HFL). Experimental results demonstrate that JDM surpasses baseline methods such as multi-step ControlNet by mere one-step in most cases, while achieving state-of-the-art performance in one-step text-to-image synthesis through improved usage of CFG or HFL integration.

Related Material

[pdf] [supp] [arXiv]

[bibtex]

@InProceedings{Luo_2025_ICCV, author = {Luo, Yihong and Hu, Tianyang and Song, Yifan and Sun, Jiacheng and Li, Zhenguo and Tang, Jing}, title = {Adding Additional Control to One-Step Diffusion with Joint Distribution Matching}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2025}, pages = {4009-4018} }