Learning Discriminative Aggregation Network for Video-Based Face Recognition

Yongming Rao, Ji Lin, Jiwen Lu, Jie Zhou; The IEEE International Conference on Computer Vision (ICCV), 2017, pp. 3781-3790

Abstract


In this paper, we propose a discriminative aggregation network (DAN) for video face recognition, which aims to integrate information from video frames effectively and efficiently. Different from existing aggregation methods, our method aggregates raw video frames directly instead of the features obtained by complex processing. By combining the idea of metric learning and adversarial learning, we learn an aggregation network that produces more discriminative synthesized images compared to input frames. Our framework reduces the number of frames to be processed and greatly speed up the recognition procedure. Furthermore, low-quality frames containing misleading information are denoised during the aggregation process, making the system more robust and discriminative. Experimental results show that our framework can generate discriminative images from video clips and improve the overall recognition performance in both the speed and accuracy on three widely used datasets.

Related Material


[pdf] [video]
[bibtex]
@InProceedings{Rao_2017_ICCV,
author = {Rao, Yongming and Lin, Ji and Lu, Jiwen and Zhou, Jie},
title = {Learning Discriminative Aggregation Network for Video-Based Face Recognition},
booktitle = {The IEEE International Conference on Computer Vision (ICCV)},
month = {Oct},
year = {2017}
}