Image Compression With Recurrent Neural Network and Generalized Divisive Normalization

Khawar Islam, L. Minh Dang, Sujin Lee, Hyeonjoon Moon; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2021, pp. 1875-1879

Abstract


Image compression is a method to remove spatial redundancy between adjacent pixels and reconstruct a high-quality image. In the past few years, deep learning has gained huge attention from the research community and produced promising image reconstruction results. Therefore, recent methods focused on developing deeper and more complex networks, which significantly increased network complexity. In this paper, two effective novel blocks are developed: analysis and synthesis block that employs the convolution layer and Generalized Divisive Normalization (GDN) in the variable-rate encoder and decoder side. Our network utilizes a pixel RNN approach for quantization. Furthermore, to improve the whole network, we encode a residual image using LSTM cells to reduce unnecessary information. Experimental results demonstrated that the proposed variable-rate framework with novel blocks outperforms existing methods and standard image codecs, such as George's and JPEG in terms of image similarity.

Related Material


[pdf]
[bibtex]
@InProceedings{Islam_2021_CVPR, author = {Islam, Khawar and Dang, L. Minh and Lee, Sujin and Moon, Hyeonjoon}, title = {Image Compression With Recurrent Neural Network and Generalized Divisive Normalization}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops}, month = {June}, year = {2021}, pages = {1875-1879} }