Dense Semantic Image Segmentation with Objects and Attributes

Shuai Zheng, Ming-Ming Cheng, Jonathan Warrell, Paul Sturgess, Vibhav Vineet, Carsten Rother, Philip H. S. Torr; The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014, pp. 3214-3221


The concepts of objects and attributes are both important for describing images precisely, since verbal descriptions often contain both adjectives and nouns (e.g. "I see a shiny red chair'). In this paper, we formulate the problem of joint visual attribute and object class image segmentation as a dense multi-labelling problem, where each pixel in an image can be associated with both an object-class and a set of visual attributes labels. In order to learn the label correlations, we adopt a boosting-based piecewise training approach with respect to the visual appearance and co-occurrence cues. We use a filtering-based mean-field approximation approach for efficient joint inference. Further, we develop a hierarchical model to incorporate region-level object and attribute information. Experiments on the aPASCAL, CORE and attribute augmented NYU indoor scenes datasets show that the proposed approach is able to achieve state-of-the-art results.

Related Material

author = {Zheng, Shuai and Cheng, Ming-Ming and Warrell, Jonathan and Sturgess, Paul and Vineet, Vibhav and Rother, Carsten and Torr, Philip H. S.},
title = {Dense Semantic Image Segmentation with Objects and Attributes},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2014}