Towards Synthetic Concept Activation Vectors via Generative Models

Campi, Riccardo; Borrego, Santiago; De Santis, Antonio; Bianchi, Matteo; Tocchetti, Andrea; Brambilla, Marco

Riccardo Campi, Santiago Borrego, Antonio De Santis, Matteo Bianchi, Andrea Tocchetti, Marco Brambilla; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2025, pp. 2745-2753

Abstract

In recent years, the field of Explainable Artificial Intelligence (XAI) has developed a new paradigm called concept-based XAI, which fosters using human-understandable concepts to verify hypotheses or test models against biases. A Concept Activation Vector (CAV) is the representation of a concept in a vision model's embedding space. However, training a CAV requires a properly sized dataset containing images of the selected concept. This may represent a limitation, as acquiring such a dataset can be difficult and time-consuming. In this context, a text-to-image generation system may help analysts build CAVs directly from texts, reducing time requirements while maintaining faithfulness. This work lays the foundations of synthetic CAV generation using pre-trained text-to-image generative models. Our approach consists of producing synthetic concept images from a descriptive prompt and using them to train a CAV in the space of a pre-trained vision model. The methodology also includes a quality control step with a multi-modal embedding model to discard images containing errors or artifacts. We evaluate the quality of our proposal by running experiments on popular ImageNet CNNs using a set of randomly chosen concepts and then comparing the synthetic CAVs with the ones from real images. Our results show that it is possible to train faithful CAVs by generating concept images, particularly for simpler concepts such as textures. Fine-tuning generative models with a few real images also yields promising results.

Related Material

[pdf] [supp]

[bibtex]

@InProceedings{Campi_2025_CVPR, author = {Campi, Riccardo and Borrego, Santiago and De Santis, Antonio and Bianchi, Matteo and Tocchetti, Andrea and Brambilla, Marco}, title = {Towards Synthetic Concept Activation Vectors via Generative Models}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops}, month = {June}, year = {2025}, pages = {2745-2753} }