DiffuseKronA: A Parameter Efficient Fine-Tuning Method for Personalized Diffusion Models

Shyam Marjit, Harshit Singh, Nityanand Mathur, Sayak Paul, Chia-Mu Yu, Pin-Yu Chen; Proceedings of the Winter Conference on Applications of Computer Vision (WACV), 2025, pp. 3529-3538

Abstract


In the realm of subject-driven text-to-image (T2I) generative models recent developments like DreamBooth and BLIP-Diffusion have led to impressive results yet encounter limitations due to their intensive fine-tuning demands and substantial parameter requirements. While the low-rank adaptation (LoRA) module within DreamBooth offers a reduction in trainable parameters it introduces a pronounced sensitivity to hyperparameters leading to a compromise between parameter efficiency and the quality of T2I personalized image synthesis. Addressing these constraints we introduce DiffuseKronA a novel Kronecker product-based adaptation module that not only significantly reduces the parameter count by 35% and 99.947% compared to LoRA-DreamBooth and the original DreamBooth respectively but also enhances the quality of image synthesis. Crucially DiffuseKronA mitigates the issue of hyperparameter sensitivity delivering consistent high-quality generations across a wide range of hyperparameters thereby diminishing the necessity for extensive fine-tuning. Furthermore a more controllable decomposition makes DiffuseKronA more interpretable and can even achieve up to a 50% reduction with results comparable to LoRA-Dreambooth. Evaluated against diverse and complex input images and text prompts DiffuseKronA consistently outperforms existing low-rank models producing diverse images of higher quality with improved fidelity and a more accurate color distribution of objects all the while upholding exceptional parameter efficiency thus presenting a substantial advancement in the field of T2I generative modeling.

Related Material


[pdf] [supp] [arXiv]
[bibtex]
@InProceedings{Marjit_2025_WACV, author = {Marjit, Shyam and Singh, Harshit and Mathur, Nityanand and Paul, Sayak and Yu, Chia-Mu and Chen, Pin-Yu}, title = {DiffuseKronA: A Parameter Efficient Fine-Tuning Method for Personalized Diffusion Models}, booktitle = {Proceedings of the Winter Conference on Applications of Computer Vision (WACV)}, month = {February}, year = {2025}, pages = {3529-3538} }