Improve Image Codec's Performance by Variating Post Enhancing Neural Network: Submission of zxw for CLIC2020

Li, Ming; Zhang, Yundong; Xia, Changsheng; Zan, Jinwen; Huang, Zhangming; Chen, Dekai; Li, Guoxin; Nie, Jing

Ming Li, Yundong Zhang, Changsheng Xia, Jinwen Zan, Zhangming Huang, Dekai Chen, Guoxin Li, Jing Nie; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2020, pp. 114-115

Abstract

Adding post enhancing filter after traditional image decoder to improve reconstruction quality is nowadays a very common method [??],[??],[??],[??]. Researchers use a large network filter or repeatedly stack single/multiple types of relative simple filters together. They all achieved better results. On the other hand the training materials and training time increases exponentially for a larger network scale, and the performance improvement becomes less and little. We learn this experience from the CLIC2019 low-rate track, where we proposed the VimicroABCnet and VimicroSpeed[??], with 2 post filters of different scale. The later one(5 time larger than the small one) achieved the final test's PSNR by improvement of only 0.02db@0.15bpp. In this paper, we propose a method to variate an existing post network filter(base filter). The base filter is altered into different ones, alternation only happens to weights. The key of the method is to divide the training data into different groups. Based on the pre-trained base filter, different altered filters are individually fine-trained with different group of training data. There are different ways to divide the training data, and we use a relative simple one. Sort by compression rate(with traditional codec) and bin the training images in to 4/8 group of training data subsets. With the new filters plus the base one, we now have 5/9 filters candidates in encoding phase and choose the best. The CLIC2019 test data show that PSNR increases 0.04db@0.15bpp and 0.06db@0.15bpp than the one filter VimicroSpeed method. This method requires the same training data and perfectly suitable for multi-GPU training scheme, and retraining the altered filters is much easier and consuming less time than training a relative large network filter. Also the result is better(5 filters scheme@0.04db vs VimicroABCnet@0.02db)

Related Material

[pdf]

[bibtex]

@InProceedings{Li_2020_CVPR_Workshops,
author = {Li, Ming and Zhang, Yundong and Xia, Changsheng and Zan, Jinwen and Huang, Zhangming and Chen, Dekai and Li, Guoxin and Nie, Jing},
title = {Improve Image Codec's Performance by Variating Post Enhancing Neural Network: Submission of zxw for CLIC2020},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
month = {June},
year = {2020}
}