AFPSNet: Multi-Class Part Parsing Based on Scaled Attention and Feature Fusion

Njuod Alsudays, Jing Wu, Yu-Kun Lai, Ze Ji; Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2023, pp. 4033-4042

Abstract


Multi-class part parsing is a dense prediction task that seeks to simultaneously detect multiple objects and the semantic parts within these objects in the scene. This problem is important in providing detailed object understanding, but is challenging due to the existence of both class-level and part-level ambiguities. In this paper, we propose to integrate an attention refinement module and a feature fusion module to tackle the part-level ambiguity. The attention refinement module aims to enhance the feature representations by focusing on important features. The feature fusion module aims to improve the fusion operation for different scales of features. We also propose an object-to-part training strategy to tackle the class-level ambiguity, which improves the localization of parts by exploiting prior knowledge of objects. The experimental results demonstrated the effectiveness of the proposed modules and the training strategy, and showed that our proposed method achieved state-of-the-art performance on the benchmark dataset.

Related Material


[pdf] [supp]
[bibtex]
@InProceedings{Alsudays_2023_WACV, author = {Alsudays, Njuod and Wu, Jing and Lai, Yu-Kun and Ji, Ze}, title = {AFPSNet: Multi-Class Part Parsing Based on Scaled Attention and Feature Fusion}, booktitle = {Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)}, month = {January}, year = {2023}, pages = {4033-4042} }