MOSAIC: Multi-Object Segmented Arbitrary Stylization Using CLIP

Ganugula, Prajwal; Kumar, Y S S S Santosh; Reddy, N K Sagar; Chellingi, Prabhath; Thakur, Avinash; Kasera, Neeraj; Anand, C Shyam

Prajwal Ganugula, Y S S S Santosh Kumar, N K Sagar Reddy, Prabhath Chellingi, Avinash Thakur, Neeraj Kasera, C Shyam Anand; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, 2023, pp. 892-903

Abstract

Style transfer driven by text prompts paved a new path for creatively stylizing the images without collecting an actual style image. Despite having promising results, with text-driven stylization, the user has no control over the stylization. If a user wants to create an artistic image, the user requires fine control over the stylization of various entities individually in the content image, which is not addressed by the current state-of-the-art approaches. On the other hand, diffusion style transfer methods also suffer from the same issue because the regional stylization control over the stylized output is ineffective. To address this problem, We propose a new method Multi-Object Segmented Arbitrary Stylization Using CLIP (MOSAIC), that can apply styles to different objects in the image based on the context extracted from the input prompt. Text-based segmentation and stylization modules which are based on vision transformer architecture, were used to segment and stylize the objects. Our method can extend to any arbitrary objects, styles and produce high-quality images compared to the current state of art methods. To our knowledge, this is the first attempt to perform text-guided arbitrary object-wise stylization. We demonstrate the effectiveness of our approach through qualitative and quantitative analysis, showing that it can generate visually appealing stylized images with enhanced control over stylization and the ability to generalize to unseen object classes.

Related Material

[pdf] [arXiv]

[bibtex]

@InProceedings{Ganugula_2023_ICCV, author = {Ganugula, Prajwal and Kumar, Y S S S Santosh and Reddy, N K Sagar and Chellingi, Prabhath and Thakur, Avinash and Kasera, Neeraj and Anand, C Shyam}, title = {MOSAIC: Multi-Object Segmented Arbitrary Stylization Using CLIP}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops}, month = {October}, year = {2023}, pages = {892-903} }