Finding Actors and Actions in Movies

P. Bojanowski, F. Bach, I. Laptev, J. Ponce, C. Schmid, J. Sivic; Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2013, pp. 2280-2287


We address the problem of learning a joint model of actors and actions in movies using weak supervision provided by scripts. Specifically, we extract actor/action pairs from the script and use them as constraints in a discriminative clustering framework. The corresponding optimization problem is formulated as a quadratic program under linear constraints. People in video are represented by automatically extracted and tracked faces together with corresponding motion features. First, we apply the proposed framework to the task of learning names of characters in the movie and demonstrate significant improvements over previous methods used for this task. Second, we explore the joint actor/action constraint and show its advantage for weakly supervised action learning. We validate our method in the challenging setting of localizing and recognizing characters and their actions in feature length movies Casablanca and American Beauty.

Related Material

author = {Bojanowski, P. and Bach, F. and Laptev, I. and Ponce, J. and Schmid, C. and Sivic, J.},
title = {Finding Actors and Actions in Movies},
booktitle = {Proceedings of the IEEE International Conference on Computer Vision (ICCV)},
month = {December},
year = {2013}