-
[pdf]
[arXiv]
[bibtex]@InProceedings{DeAlcala_2025_ICCV, author = {DeAlcala, Daniel and Morales, Aythami and Fierrez, Julian and Tolosana, Ruben}, title = {AttZoom: Attention Zoom for Better Visual Features}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops}, month = {October}, year = {2025}, pages = {4893-4902} }
AttZoom: Attention Zoom for Better Visual Features
Abstract
We present Attention Zoom, a modular and model-agnostic spatial attention mechanism designed to improve feature extraction in convolutional neural networks (CNNs). Unlike traditional attention approaches that require architecture-specific integration, our method introduces a standalone layer that spatially emphasizes high-importance regions in the input. We evaluated Attention Zoom on multiple CNN backbones using CIFAR-100 and TinyImageNet, showing consistent improvements in Top-1 and Top-5 classification accuracy. Visual analyses using Grad-CAM and spatial warping reveal that our method encourages fine-grained and diverse attention patterns. Our results confirm the effectiveness and generality of the proposed layer for improving CCNs with minimal architectural overhead.
Related Material
