Extensive Facial Landmark Localization with Coarse-to-Fine Convolutional Network Cascade

Erjin Zhou, Haoqiang Fan, Zhimin Cao, Yuning Jiang, Qi Yin; Proceedings of the IEEE International Conference on Computer Vision (ICCV) Workshops, 2013, pp. 386-391

Abstract


We present a new approach to localize extensive facial landmarks with a coarse-to-fine convolutional network cascade. Deep convolutional neural networks (DCNN) have been successfully utilized in facial landmark localization for two-fold advantages: 1) geometric constraints among facial points are implicitly utilized; 2) huge amount of training data can be leveraged. However, in the task of extensive facial landmark localization, a large number of facial landmarks (more than 50 points) are required to be located in a unified system, which poses great difficulty in the structure design and training process of traditional convolutional networks. In this paper, we design a four-level convolutional network cascade, which tackles the problem in a coarse-to-fine manner. In our system, each network level is trained to locally refine a subset of facial landmarks generated by previous network levels. In addition, each level predicts explicit geometric constraints (the position and rotation angles of a specific facial component) to rectify the inputs of the current network level. The combination of coarse-to-fine cascade and geometric refinement enables our system to locate extensive facial landmarks (68 points) accurately in the 300-W facial landmark localization challenge.

Related Material


[pdf]
[bibtex]
@InProceedings{Zhou_2013_ICCV_Workshops,
author = {Erjin Zhou and Haoqiang Fan and Zhimin Cao and Yuning Jiang and Qi Yin},
title = {Extensive Facial Landmark Localization with Coarse-to-Fine Convolutional Network Cascade},
booktitle = {Proceedings of the IEEE International Conference on Computer Vision (ICCV) Workshops},
month = {June},
year = {2013}
}