Improving Shape Awareness and Interpretability in Deep Networks Using Geometric Moments

Rajhans Singh, Ankita Shukla, Pavan Turaga; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2023, pp. 4159-4168

Abstract


Deep networks for image classification often rely more on texture information than object shape. While efforts have been made to make deep-models shape-aware, it is often difficult to make such models simple, interpretable, or rooted in known mathematical definitions of shape. This paper presents a deep-learning model inspired by geometric moments, a classically well understood approach to measure shape-related properties. The proposed method consists of a trainable network for generating coordinate bases and affine parameters for making the features geometrically invariant yet in a task-specific manner. The proposed model improves the final feature's interpretation. We demonstrate the effectiveness of our method on standard image classification datasets. The proposed model achieves higher classification performance compared to the baseline and standard ResNet models while substantially improving interpretability.

Related Material


[pdf] [supp] [arXiv]
[bibtex]
@InProceedings{Singh_2023_CVPR, author = {Singh, Rajhans and Shukla, Ankita and Turaga, Pavan}, title = {Improving Shape Awareness and Interpretability in Deep Networks Using Geometric Moments}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops}, month = {June}, year = {2023}, pages = {4159-4168} }