Using Large Text To Image Models with Structured Prompts for Skin Disease Identification: A Case Study

Rajapaksa, Sajith; Vianney, Jean Marie Uwabeza; Castro, Renell; Khalvati, Farzad; Aich, Shubhra

Sajith Rajapaksa, Jean Marie Uwabeza Vianney, Renell Castro, Farzad Khalvati, Shubhra Aich; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, 2023, pp. 2694-2701

Abstract

This paper investigates the potential usage of large text-to-image (LTI) models for the automated diagnosis of a few skin conditions with rarity or a serious lack of annotated datasets. As the input to the LTI model, we provide the targeted instantiation of a generic but succinct prompt structure designed upon careful observations of the conditional narratives from the standard medical textbooks. In this regard, we pave the path to utilizing accessible textbook descriptions for automated diagnosis of conditions with data scarcity through the lens of LTI models. Experiments show the efficacy of the proposed framework, including much better localization of the infected regions. Moreover, it has the immense possibility for generalization across the medical sub-domains, not only to mitigate the data scarcity issue but also to debias automated diagnostics from the all-pervasive racial biases.

Related Material

[pdf] [arXiv]

[bibtex]

@InProceedings{Rajapaksa_2023_ICCV, author = {Rajapaksa, Sajith and Vianney, Jean Marie Uwabeza and Castro, Renell and Khalvati, Farzad and Aich, Shubhra}, title = {Using Large Text To Image Models with Structured Prompts for Skin Disease Identification: A Case Study}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops}, month = {October}, year = {2023}, pages = {2694-2701} }