Scene Intrinsics and Depth From a Single Image

Evan Shelhamer, Jonathan T. Barron, Trevor Darrell; Proceedings of the IEEE International Conference on Computer Vision (ICCV) Workshops, 2015, pp. 37-44


Intrinsic image decomposition factorizes an observed image into its physical causes. This is most commonly framed as a decomposition into reflectance and shading, although recent progress has made full decompositions into shape, illumination, reflectance, and shading possible. However, existing factorization approaches require depth sensing to initialize the optimization of scene intrinsics. Rather than relying on depth sensors, we show that depth estimated purely from monocular appearance can provide sufficient cues for intrinsic image analysis. Our full intrinsic pipeline regresses depth by a fully convolutional network then jointly optimizes the intrinsic factorization to recover the input image. This combination yields full decompositions by uniting feature learning through deep network regression with physical modeling through statistical priors and random field regularization. This work demonstrates the first pipeline for full intrinsic decomposition of scenes from a single color image input alone.

Related Material

author = {Shelhamer, Evan and Barron, Jonathan T. and Darrell, Trevor},
title = {Scene Intrinsics and Depth From a Single Image},
booktitle = {Proceedings of the IEEE International Conference on Computer Vision (ICCV) Workshops},
month = {December},
year = {2015}