Low Quality Video Face Recognition: Multi-Mode Aggregation Recurrent Network (MARN)

Sixue Gong, Yichun Shi, Anil Jain; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2019, pp. 0-0

Abstract


Face recognition performance deteriorates when face images are of very low quality. For low quality video sequences, however, more discriminative features can be obtained by aggregating the information in video frames. We propose a Multi-mode Aggregation Recurrent Network (MARN) for real-world low-quality video face recognition. Unlike existing recurrent networks (RNNs), MARN is robust against overfitting since it learns to aggregate pre-trained embeddings. Compared with quality-aware aggregation methods, MARN utilizes the video context and learns multiple attention vectors adaptively. Empirical results on three video face recognition datasets, IJB-S, YTF, and PaSC show that MARN significantly boosts the performance on the low quality video dataset while achieves comparable results on high quality video datasets.

Related Material


[pdf]
[bibtex]
@InProceedings{Gong_2019_ICCV,
author = {Gong, Sixue and Shi, Yichun and Jain, Anil},
title = {Low Quality Video Face Recognition: Multi-Mode Aggregation Recurrent Network (MARN)},
booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops},
month = {Oct},
year = {2019}
}