Characterizing and Improving Stability in Neural Style Transfer

Agrim Gupta, Justin Johnson, Alexandre Alahi, Li Fei-Fei; Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2017, pp. 4067-4076

Abstract


Recent progress in style transfer on images has focused on improving the quality of stylized images and speed of methods. However, real-time methods are highly unstable resulting in visible flickering when applied to videos. In this work we characterize the instability of these methods by examining the solution set of the style transfer objective. We show that the trace of the Gram matrix representing style is inversely related to the stability of the method. Then, we present a recurrent convolutional network for real-time video style transfer which incorporates a temporal consistency loss and overcomes the instability of prior methods. Our networks can be applied at any resolution, do not require optical flow at test time, and produce high quality, temporally consistent stylized videos in real-time.

Related Material


[pdf] [supp] [arXiv]
[bibtex]
@InProceedings{Gupta_2017_ICCV,
author = {Gupta, Agrim and Johnson, Justin and Alahi, Alexandre and Fei-Fei, Li},
title = {Characterizing and Improving Stability in Neural Style Transfer},
booktitle = {Proceedings of the IEEE International Conference on Computer Vision (ICCV)},
month = {Oct},
year = {2017}
}