Intrinsic Image Transformation via Scale Space Decomposition

Lechao Cheng, Chengyi Zhang, Zicheng Liao; The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018, pp. 656-665


We introduce a new network structure for decomposing an image into its intrinsic albedo and shading. We treat this as an image-to-image transformation problem and explore the scale space of the input and output. By expanding the output images (albedo and shading) into their Laplacian pyramid components, we develop a multi-channel network structure that learns the image-to-image transformation function in successive frequency bands in parallel, within each channel is a fully convolutional neural network with skip connections. This network structure is general and extensible, and has demonstrated excellent performance on the intrinsic image decomposition problem. We evaluate the network on two benchmark datasets: the MPI-Sintel dataset and the MIT Intrinsic Images dataset. Both quantitative and qualitative results show our model delivers a clear progression over state-of-the-art.

Related Material

[pdf] [arXiv]
author = {Cheng, Lechao and Zhang, Chengyi and Liao, Zicheng},
title = {Intrinsic Image Transformation via Scale Space Decomposition},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2018}