-
[pdf]
[arXiv]
[bibtex]@InProceedings{Echevarrieta-Catalan_2025_CVPR, author = {Echevarrieta-Catalan, Nicolas and Ribas-Rodriguez, Ana and Cedron, Francisco and Schwartz, Odelia and Aguiar-Pulido, Vanessa}, title = {Enhancing Vision Transformer Explainability Using Artificial Astrocytes}, booktitle = {Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR) Workshops}, month = {June}, year = {2025}, pages = {58-64} }
Enhancing Vision Transformer Explainability Using Artificial Astrocytes
Abstract
Machine learning models achieve high precision, but their decision-making processes often lack explainability. Furthermore, as model complexity increases, explainability typically decreases. Existing efforts to improve explainability primarily involve developing new eXplainable artificial intelligence (XAI) techniques or incorporating explainability constraints during training. While these approaches yield specific improvements, their applicability remains limited. In this work, we propose the Vision Transformer with artificial Astrocytes (ViTA). This training-free approach is inspired by neuroscience and enhances the reasoning of a pretrained deep neural network to generate more human-aligned explanations. We evaluated our approach employing two well-known XAI techniques, Grad-CAM and Grad-CAM++, and compared it to a standard Vision Transformer (ViT). Using the ClickMe dataset, we quantified the similarity between the heatmaps produced by the XAI techniques and a (human-aligned) ground truth. Our results consistently demonstrate that incorporating artificial astrocytes enhances the alignment of model explanations with human perception, leading to statistically significant improvements across all XAI techniques and metrics utilized.
Related Material