Learning-by-Synthesis for Appearance-based 3D Gaze Estimation

Yusuke Sugano, Yasuyuki Matsushita, Yoichi Sato; The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014, pp. 1821-1828


Inferring human gaze from low-resolution eye images is still a challenging task despite its practical importance in many application scenarios. This paper presents a learning-by-synthesis approach to accurate image-based gaze estimation that is person- and head pose-independent. Unlike existing appearance-based methods that assume person-specific training data, we use a large amount of cross-subject training data to train a 3D gaze estimator. We collect the largest and fully calibrated multi-view gaze dataset and perform a 3D reconstruction in order to generate dense training data of eye images. By using the synthesized dataset to learn a random regression forest, we show that our method outperforms existing methods that use low-resolution eye images.

Related Material

author = {Sugano, Yusuke and Matsushita, Yasuyuki and Sato, Yoichi},
title = {Learning-by-Synthesis for Appearance-based 3D Gaze Estimation},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2014}