Semantic Transform: Weakly Supervised Semantic Inference for Relating Visual Attributes

Sukrit Shankar, Joan Lasenby, Roberto Cipolla; The IEEE International Conference on Computer Vision (ICCV), 2013, pp. 361-368

Abstract


Relative (comparative) attributes are promising for thematic ranking of visual entities, which also aids in recognition tasks [19, 23]. However, attribute rank learning often requires a substantial amount of relational supervision, which is highly tedious, and apparently impractical for realworld applications. In this paper, we introduce the Semantic Transform, which under minimal supervision, adaptively finds a semantic feature space along with a class ordering that is related in the best possible way. Such a semantic space is found for every attribute category. To relate the classes under weak supervision, the class ordering needs to be refined according to a cost function in an iterative procedure. This problem is ideally NP-hard, and we thus propose a constrained search tree formulation for the same. Driven by the adaptive semantic feature space representation, our model achieves the best results to date for all of the tasks of relative, absolute and zero-shot classification on two popular datasets.

Related Material


[pdf]
[bibtex]
@InProceedings{Shankar_2013_ICCV,
author = {Shankar, Sukrit and Lasenby, Joan and Cipolla, Roberto},
title = {Semantic Transform: Weakly Supervised Semantic Inference for Relating Visual Attributes},
booktitle = {The IEEE International Conference on Computer Vision (ICCV)},
month = {December},
year = {2013}
}