Image Quality Assessment With Transformers and Multi-Metric Fusion Modules

Wei Jiang, Litian Li, Yi Ma, Yongqi Zhai, Zheng Yang, Ronggang Wang; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2022, pp. 1805-1809

Abstract


Image quality assessment is crucial for low-level vision tasks such as compression, super-resolution, denoising and etc. It guides researchers how to design networks, design loss functions, and decide the optimization direction of networks. A good quality assessment metric should comform to people's subjective feelings as much as possible. Traditional PSNR and MS-SSIM have more and more obvious shortcomings in quality evaluation With the popularity of GANs. Inspired by metrics such as LPIPS, IQT, etc., we decided to design a metric that is learned by the network itself. In this paper, we use a ConvNeXt-Tiny network to extract features and calculate nonlinear residuals between reference images and distorted images. We feed residuals into a transformer to compare the degree of distortion. In addition, we use multi-metric fusion to improve the performance of our network. Our model achieves 0.780 accuracy on CLIC validation set.

Related Material


[pdf]
[bibtex]
@InProceedings{Jiang_2022_CVPR, author = {Jiang, Wei and Li, Litian and Ma, Yi and Zhai, Yongqi and Yang, Zheng and Wang, Ronggang}, title = {Image Quality Assessment With Transformers and Multi-Metric Fusion Modules}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops}, month = {June}, year = {2022}, pages = {1805-1809} }