End-to-end 6-DoF Object Pose Estimation through Differentiable Rasterization

Andrea Palazzi, Luca Bergamini, Simone Calderara, Rita Cucchiara; Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2018, pp. 0-0

Abstract


Here we introduce an approximated differentiable renderer to refine a 6-DoF pose prediction using only 2D alignment information. To this end, a two-branched convolutional encoder network is employed to jointly estimate the object class and its 6-DoF pose in the scene. We then propose a new formulation of an approximated differentiable renderer to re-project the 3D object on the image according to its predicted pose; in this way the alignment error between the observed and the re-projected object silhouette can be measured. Since the renderer is differentiable, it is possible to back-propagate through it to correct the estimated pose at test time in an online learning fashion. Eventually we show how to leverage the classification branch to profitably re-project a representative model of the predicted class (i.e. a medoid) instead. Each object in the scene is processed independently and novel viewpoints in which both objects arrangement and mutual pose are preserved can be rendered.

Related Material


[pdf]
[bibtex]
@InProceedings{Palazzi_2018_ECCV_Workshops,
author = {Palazzi, Andrea and Bergamini, Luca and Calderara, Simone and Cucchiara, Rita},
title = {End-to-end 6-DoF Object Pose Estimation through Differentiable Rasterization},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV) Workshops},
month = {September},
year = {2018}
}