Discovering States and Transformations in Image Collections

Phillip Isola, Joseph J. Lim, Edward H. Adelson; The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, pp. 1383-1391


Objects in visual scenes come in a rich variety of transformed states. A few classes of transformation have been heavily studied in computer vision: mostly simple, parametric changes in color and geometry. However, transformations in the physical world occur in many more flavors, and they come with semantic meaning: e.g., bending, folding, aging, etc. The transformations an object can undergo tell us about its physical and functional properties. In this paper, we introduce a dataset of objects, scenes, and materials, each of which is found in a variety of transformed states. Given a novel collection of images, we show how to explain the collection in terms of the states and transformations it depicts. Our system works by generalizing across object classes: states and transformations learned on one set of objects are used to interpret the image collection for an entirely new object class.

Related Material

author = {Isola, Phillip and Lim, Joseph J. and Adelson, Edward H.},
title = {Discovering States and Transformations in Image Collections},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2015}