Large-Scale Image Annotation by Efficient and Robust Kernel Metric Learning

Zheyun Feng, Rong Jin, Anil Jain; The IEEE International Conference on Computer Vision (ICCV), 2013, pp. 1609-1616


One of the key challenges in search-based image annotation models is to define an appropriate similarity measure between images. Many kernel distance metric learning (KML) algorithms have been developed in order to capture the nonlinear relationships between visual features and semantics of the images. One fundamental limitation in applying KML to image annotation is that it requires converting image annotations into binary constraints, leading to a significant information loss. In addition, most KML algorithms suffer from high computational cost due to the requirement that the learned matrix has to be positive semi-definitive (PSD). In this paper, we propose a robust kernel metric learning (RKML) algorithm based on the regression technique that is able to directly utilize image annotations. The proposed method is also computationally more efficient because PSD property is automatically ensured by regression. We provide the theoretical guarantee for the proposed algorithm, and verify its efficiency and effectiveness for image annotation by comparing it to state-of-the-art approaches for both distance metric learning and image annotation.

Related Material

author = {Feng, Zheyun and Jin, Rong and Jain, Anil},
title = {Large-Scale Image Annotation by Efficient and Robust Kernel Metric Learning},
booktitle = {The IEEE International Conference on Computer Vision (ICCV)},
month = {December},
year = {2013}