AttZoom: Attention Zoom for Better Visual Features

Daniel DeAlcala, Aythami Morales, Julian Fierrez, Ruben Tolosana; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, 2025, pp. 4893-4902

Abstract


We present Attention Zoom, a modular and model-agnostic spatial attention mechanism designed to improve feature extraction in convolutional neural networks (CNNs). Unlike traditional attention approaches that require architecture-specific integration, our method introduces a standalone layer that spatially emphasizes high-importance regions in the input. We evaluated Attention Zoom on multiple CNN backbones using CIFAR-100 and TinyImageNet, showing consistent improvements in Top-1 and Top-5 classification accuracy. Visual analyses using Grad-CAM and spatial warping reveal that our method encourages fine-grained and diverse attention patterns. Our results confirm the effectiveness and generality of the proposed layer for improving CCNs with minimal architectural overhead.

Related Material


[pdf] [arXiv]
[bibtex]
@InProceedings{DeAlcala_2025_ICCV, author = {DeAlcala, Daniel and Morales, Aythami and Fierrez, Julian and Tolosana, Ruben}, title = {AttZoom: Attention Zoom for Better Visual Features}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops}, month = {October}, year = {2025}, pages = {4893-4902} }