Robust Object Recognition in RGB-D Egocentric Videos Based on Sparse Affine Hull Kernel

Wan, Shaohua; Aggarwal, J.K.

Shaohua Wan, J.K. Aggarwal; Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2015, pp. 97-104

Abstract

In this paper, we propose a novel kernel function for recognizing objects in RGB-D egocentric videos. In order to effectively exploit the varied object appearance in a video, we take a set-based recognition approach and represent the target object using the set of frames contained in the video. Our kernel function measures the similarity of two sets by the minimum distance between the sparse affine hulls of the two sets. Our kernel function also allows convenient integration of heterogeneous data modalities beyond RGB and depth. We extensively evaluate the proposed method on three benchmark datasets, including two RGB-D object datasets and one thermal/visible face dataset. All the results clearly show that the proposed method outperforms state-of-the-art methods.

Related Material

[pdf]

[bibtex]

@InProceedings{Wan_2015_CVPR_Workshops,
author = {Wan, Shaohua and Aggarwal, J.K.},
title = {Robust Object Recognition in RGB-D Egocentric Videos Based on Sparse Affine Hull Kernel},
booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
month = {June},
year = {2015}
}