Beyond the Limitation of Monocular 3D Detector via Knowledge Distillation

Yang, Yiran; Yin, Dongshuo; Rong, Xuee; Sun, Xian; Diao, Wenhui; Li, Xinming

Yiran Yang, Dongshuo Yin, Xuee Rong, Xian Sun, Wenhui Diao, Xinming Li; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023, pp. 9077-9086

Abstract

Knowledge distillation (KD) is a promising approach that facilitates the compact student model to learn dark knowledge from the huge teacher model for better results. Although KD methods are well explored in the 2D detection task, existing approaches are not suitable for 3D monocular detection without considering spatial cues. Motivated by the potential of depth information, we propose a novel distillation framework that validly improves the performance of the student model without extra depth labels. Specifically, we first put forward a perspective-induced feature imitation, which utilizes the perspective principle (the farther the smaller) to facilitate the student to imitate more features of farther objects from the teacher model. Moreover, we construct a depth-guided matrix by the predicted depth gap of teacher and student to facilitate the model to learn more knowledge of farther objects in prediction level distillation. The proposed method is available for advanced monocular detectors with various backbones, which also brings no extra inference time. Extensive experiments on the KITTI and nuScenes benchmarks with diverse settings demonstrate that the proposed method outperforms the state-of-the-art KD methods.

Related Material

[pdf]

[bibtex]

@InProceedings{Yang_2023_ICCV, author = {Yang, Yiran and Yin, Dongshuo and Rong, Xuee and Sun, Xian and Diao, Wenhui and Li, Xinming}, title = {Beyond the Limitation of Monocular 3D Detector via Knowledge Distillation}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2023}, pages = {9077-9086} }