-
[pdf]
[supp]
[bibtex]@InProceedings{Yang_2025_ICCV, author = {Yang, Jiahui and Ma, Yongjia and Di, Donglin and Cui, Jianxun and Li, Hao and Chen, Wei and Xie, Yan and Yang, Xun and Zuo, Wangmeng}, title = {QR-LoRA: Efficient and Disentangled Fine-tuning via QR Decomposition for Customized Generation}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2025}, pages = {17587-17597} }
QR-LoRA: Efficient and Disentangled Fine-tuning via QR Decomposition for Customized Generation
Abstract
Existing text-to-image models often rely on parame- ter fine-tuning techniques such as Low-Rank Adaptation (LoRA) to customize visual attributes. However, when com- bining multiple LoRA models for content-style fusion tasks, unstructured modifications of weight matrices often lead to undesired feature entanglement between content and style attributes. We propose QR-LoRA, a novel fine-tuning frame- work leveraging QR decomposition for structured parame- ter updates that effectively separate visual attributes. Our key insight is that the orthogonal Q matrix naturally min- imizes interference between different visual features, while the upper triangular R matrix efficiently encodes attribute- specific transformations. Our approach fixes both Q and R matrices while only training an additional task-specific R matrix. This structured design reduces trainable param- eters to half of conventional LoRA methods and supports effective merging of multiple adaptations without cross- contamination due to the strong disentanglement properties between R matrices. Experiments demonstrate that QR- LoRA achieves superior disentanglement in content-style fusion tasks, establishing a new paradigm for parameter- efficient, disentangled fine-tuning in generative models. The project page is available at: https://luna-ai-lab. github.io/QR-LoRA/
Related Material
