Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models

Adrien Le Coz, Houssem Ouertatani, Stéphane Herbin, Faouzi Adjed; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2024, pp. 7569-7578

Abstract


Image classifiers should be used with caution in the real world. Performance evaluated on a validation set may not reflect performance in the real world. In particular classifiers may perform well for conditions that are frequently encountered during training but poorly for other infrequent conditions. In this study we hypothesize that recent advances in text-to-image generative models make them valuable for benchmarking computer vision models such as image classifiers: they can generate images conditioned by textual prompts that cause classifier failures allowing failure conditions to be described with textual attributes. However their generation cost becomes an issue when a large number of synthetic images need to be generated which is the case when many different attribute combinations need to be tested. We propose an image classifier benchmarking method as an iterative process that alternates image generation classifier evaluation and attribute selection. This method efficiently explores the attributes that ultimately lead to poor behavior detection.

Related Material


[pdf] [arXiv]
[bibtex]
@InProceedings{Le_Coz_2024_CVPR, author = {Le Coz, Adrien and Ouertatani, Houssem and Herbin, St\'ephane and Adjed, Faouzi}, title = {Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops}, month = {June}, year = {2024}, pages = {7569-7578} }