-
[pdf]
[supp]
[bibtex]@InProceedings{Thanyawet_2025_WACV, author = {Thanyawet, Narongthat and Ratsamee, Photchara and Uranishi, Yuki and Takemura, Haruo}, title = {Detective Networks: Enhancing Disaster Recognition in Images Through Attention Shifting using Optimal Masking}, booktitle = {Proceedings of the Winter Conference on Applications of Computer Vision (WACV)}, month = {February}, year = {2025}, pages = {8723-8732} }
Detective Networks: Enhancing Disaster Recognition in Images Through Attention Shifting using Optimal Masking
Abstract
Aerial investigation is used for surveying damage and identifying post-disaster events through imagery data. However the challenge lies in detecting disaster-related areas within aerial or shipborne images as these can appear as minor regions making recognition difficult. To address this challenge we introduce the Detective Network (DeNet) designed to optimally mask images thereby shifting the attention of machine learning models towards these small yet crucial regions. Utilizing the concepts of patch and anchor box DeNet incorporates a masking candidate layer and a masking layer to facilitate optimal masking. Our experimental findings are compelling; by preprocessing images with DeNet before analysis using an image captioning model we achieved a remarkable accuracy of 92.91% in landslide detection from side-view image captions and 87.50% for shipborne view detection. The result demonstrates the efficacy of DeNet in enhancing the recognition of disaster-related areas in challenging imaging conditions.
Related Material