Backbone Based Feature Enhancement for Object Detection

Haoqin Ji, Weizeng Lu, Linlin Shen; Proceedings of the Asian Conference on Computer Vision (ACCV), 2020


FPN (Feature Pyramid Networks) and many of its variants have been widely used in state of the art object detectors and made remarkable progress in detection performance. However, almost all the architectures of feature pyramid are manually designed, which requires ad hoc design and prior knowledge. Meanwhile, existing methods focus on exploring more appropriate connections to generate features with strong semantics features from inherent pyramidal hierarchy of deep ConvNets (Convolutional Networks). In this paper, we propose a simple but effective approach, named BBFE (Backbone Based Feature Enhancement), to directly enhance the semantics of shallow features from backbone ConvNets. The proposed BBFE consists of two components: reusing backbone weight and personalized feature enhancement. We also proposed a fast version of BBFE, named Fast-BBFE, to achieve better trade-off between efficiency and accuracy. Without bells and whistles, our BBFE improves different baseline methods (both anchor-based and anchor-free) by a large margin ( 2.0 points higher AP) on COCO, surpassing common feature pyramid networks including FPN and PANet.

Related Material

@InProceedings{Ji_2020_ACCV, author = {Ji, Haoqin and Lu, Weizeng and Shen, Linlin}, title = {Backbone Based Feature Enhancement for Object Detection}, booktitle = {Proceedings of the Asian Conference on Computer Vision (ACCV)}, month = {November}, year = {2020} }