Evaluating Model Perception of Color Illusions in Photorealistic Scenes

Lingjun Mao, Zineng Tang, Alane Suhr; Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR), 2025, pp. 7805-7814

Abstract


We study the perception of color illusions by vision-language models. Color illusion, where a person's visual system perceives color differently from actual color, is well-studied in human vision. However, it remains underexplored whether vision-language models (VLMs), trained on large-scale human data, exhibit similar perceptual biases when confronted with such color illusions. We propose an automated framework for generating color illusion images, resulting in RCID (Realistic Color Illusion Dataset), a dataset of 19,000 realistic illusion images. Our experiments show that all studied VLMs exhibit perceptual biases similar human vision. Finally, we train a model to distinguish both human perception and actual pixel differences.

Related Material


[pdf] [supp] [arXiv]
[bibtex]
@InProceedings{Mao_2025_CVPR, author = {Mao, Lingjun and Tang, Zineng and Suhr, Alane}, title = {Evaluating Model Perception of Color Illusions in Photorealistic Scenes}, booktitle = {Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR)}, month = {June}, year = {2025}, pages = {7805-7814} }