Real-Time Monocular Pose Estimation of 3D Objects Using Temporally Consistent Local Color Histograms

Henning Tjaden, Ulrich Schwanecke, Elmar Schomer; Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2017, pp. 124-132

Abstract


We present a novel approach to 6DOF pose estimation and segmentation of rigid 3D objects using a single monocular RGB camera based on temporally consistent, local color histograms. We show that this approach outperforms previous methods in cases of cluttered backgrounds, heterogenous objects, and occlusions. The proposed histograms can be used as statistical object descriptors within a template matching strategy for pose recovery after temporary tracking loss e. g. caused by massive occlusion or if the object leaves the camera's field of view. The descriptors can be trained online within a couple of seconds moving a handheld object in front of a camera. During the training stage, our approach is already capable to recover from accidental tracking loss. We demonstrate the performance of our method in comparison to the state of the art in different challenging experiments including a popular public data set.

Related Material


[pdf] [supp]
[bibtex]
@InProceedings{Tjaden_2017_ICCV,
author = {Tjaden, Henning and Schwanecke, Ulrich and Schomer, Elmar},
title = {Real-Time Monocular Pose Estimation of 3D Objects Using Temporally Consistent Local Color Histograms},
booktitle = {Proceedings of the IEEE International Conference on Computer Vision (ICCV)},
month = {Oct},
year = {2017}
}