A Structure-Aware Method for Direct Pose Estimation

Hunter Blanton, Scott Workman, Nathan Jacobs; Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2022, pp. 2019-2028

Abstract


Estimating camera pose from a single image is a fundamental problem in computer vision. Existing methods for solving this task fall into two distinct categories, which we refer to as direct and indirect. Direct methods, such as PoseNet, regress pose from the image as a fixed function, for example using a feed-forward convolutional network. Such methods are desirable because they are deterministic and run in constant time. Indirect methods for pose regression are often non-deterministic, with various external dependencies such as image retrieval and hypothesis sampling. We propose a direct method that takes inspiration from structure-based approaches to incorporate explicit 3D constraints into the network. Our approach maintains the desirable qualities of other direct methods while achieving much lower error in general. Code is available https://github.com/mvrl/structure-aware-pose-estimation.

Related Material


[pdf] [supp] [arXiv]
[bibtex]
@InProceedings{Blanton_2022_WACV, author = {Blanton, Hunter and Workman, Scott and Jacobs, Nathan}, title = {A Structure-Aware Method for Direct Pose Estimation}, booktitle = {Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)}, month = {January}, year = {2022}, pages = {2019-2028} }