FastCLIPstyler: Optimisation-Free Text-Based Image Style Transfer Using Style Representations

Ananda Padhmanabhan Suresh, Sanjana Jain, Pavit Noinongyao, Ankush Ganguly, Ukrit Watchareeruetai, Aubin Samacoits; Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024, pp. 7316-7325

Abstract


In recent years, language-driven artistic style transfer has emerged as a new type of style transfer technique, eliminating the need for a reference style image by using natural language descriptions of the style. The first model to achieve this, called CLIPstyler, has demonstrated impressive stylisation results. However, its lengthy optimisation procedure at runtime for each query limits its suitability for many practical applications. In this work, we present FastCLIPstyler, a generalised text-based image style transfer model capable of stylising images in a single forward pass for arbitrary text inputs. Furthermore, we introduce EdgeCLIPstyler, a lightweight model designed for compatibility with resource-constrained devices. Through quantitative and qualitative comparisons with state-of-the-art approaches, we demonstrate that our models achieve superior stylisation quality based on measurable metrics while offering significantly improved runtime efficiency, particularly on edge devices.

Related Material


[pdf] [supp] [arXiv]
[bibtex]
@InProceedings{Suresh_2024_WACV, author = {Suresh, Ananda Padhmanabhan and Jain, Sanjana and Noinongyao, Pavit and Ganguly, Ankush and Watchareeruetai, Ukrit and Samacoits, Aubin}, title = {FastCLIPstyler: Optimisation-Free Text-Based Image Style Transfer Using Style Representations}, booktitle = {Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)}, month = {January}, year = {2024}, pages = {7316-7325} }