New Graph Structured Sparsity Model for Multi-label Image Annotations

Xiao Cai, Feiping Nie, Weidong Cai, Heng Huang; The IEEE International Conference on Computer Vision (ICCV), 2013, pp. 801-808


In multi-label image annotations, because each image is associated to multiple categories, the semantic terms (label classes) are not mutually exclusive. Previous research showed that such label correlations can largely boost the annotation accuracy. However, all existing methods only directly apply the label correlation matrix to enhance the label inference and assignment without further learning the structural information among classes. In this paper, we model the label correlations using the relational graph, and propose a novel graph structured sparse learning model to incorporate the topological constraints of relation graph in multi-label classifications. As a result, our new method will capture and utilize the hidden class structures in relational graph to improve the annotation results. In proposed objective, a large number of structured sparsity-inducing norms are utilized, thus the optimization becomes difficult. To solve this problem, we derive an efficient optimization algorithm with proved convergence. We perform extensive experiments on six multi-label image annotation benchmark data sets. In all empirical results, our new method shows better annotation results than the state-of-the-art approaches.

Related Material

author = {Cai, Xiao and Nie, Feiping and Cai, Weidong and Huang, Heng},
title = {New Graph Structured Sparsity Model for Multi-label Image Annotations},
booktitle = {The IEEE International Conference on Computer Vision (ICCV)},
month = {December},
year = {2013}