Rethinking Feature-Based Knowledge Distillation for Face Recognition

Li, Jingzhi; Guo, Zidong; Li, Hui; Han, Seungju; Baek, Ji-won; Yang, Min; Yang, Ran; Suh, Sungjoo

Jingzhi Li, Zidong Guo, Hui Li, Seungju Han, Ji-won Baek, Min Yang, Ran Yang, Sungjoo Suh; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, pp. 20156-20165

Abstract

With the continual expansion of face datasets, feature-based distillation prevails for large-scale face recognition. In this work, we attempt to remove identity supervision in student training, to spare the GPU memory from saving massive class centers. However, this naive removal leads to inferior distillation result. We carefully inspect the performance degradation from the perspective of intrinsic dimension, and argue that the gap in intrinsic dimension, namely the intrinsic gap, is intimately connected to the infamous capacity gap problem. By constraining the teacher's search space with reverse distillation, we narrow the intrinsic gap and unleash the potential of feature-only distillation. Remarkably, the proposed reverse distillation creates universally student-friendly teacher that demonstrates outstanding student improvement. We further enhance its effectiveness by designing a student proxy to better bridge the intrinsic gap. As a result, the proposed method surpasses state-of-the-art distillation techniques with identity supervision on various face recognition benchmarks, and the improvements are consistent across different teacher-student pairs.

Related Material

[pdf] [supp]

[bibtex]

@InProceedings{Li_2023_CVPR, author = {Li, Jingzhi and Guo, Zidong and Li, Hui and Han, Seungju and Baek, Ji-won and Yang, Min and Yang, Ran and Suh, Sungjoo}, title = {Rethinking Feature-Based Knowledge Distillation for Face Recognition}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year = {2023}, pages = {20156-20165} }