Semi-supervised Learning with Constraints for Person Identification in Multimedia Data

Martin Bauml, Makarand Tapaswi, Rainer Stiefelhagen; Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013, pp. 3602-3609

Abstract


We address the problem of person identification in TV series. We propose a unified learning framework for multiclass classification which incorporates labeled and unlabeled data, and constraints between pairs of features in the training. We apply the framework to train multinomial logistic regression classifiers for multi-class face recognition. The method is completely automatic, as the labeled data is obtained by tagging speaking faces using subtitles and fan transcripts of the videos. We demonstrate our approach on six episodes each of two diverse TV series and achieve state-of-the-art performance.

Related Material


[pdf]
[bibtex]
@InProceedings{Bauml_2013_CVPR,
author = {Bauml, Martin and Tapaswi, Makarand and Stiefelhagen, Rainer},
title = {Semi-supervised Learning with Constraints for Person Identification in Multimedia Data},
booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2013}
}