RBSFormer: Enhanced Transformer Network for Raw Image Super-Resolution

Siyuan Jiang, Senyan Xu, Xingfu Wang; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2024, pp. 6479-6488

Abstract


In smartphones and mobile camera devices the Image Signal Processor(ISP) is applied to reconstruct the RAW image into a sRGB image for human reading by a series of signal modules. Due to the non-linear ISP transformation it is complicated to model the degradation in the sRGB domain. Most existing super-resolution methods directly handle the sRGB image processed by the ISP introducing more difficult degradation patterns. To address this challenge we propose an enhanced transformer network named RBSFormer. Unlike other methods that operate on sRGB images RBSFormer takes RAW images as input thus avoiding the complex degradation introduced by ISP processing. We design two enhanced core components i.e. Enhanced Cross-Covairance Attention(EXCA) and Enhanced Gated Feed-forward Network(EGFN) in the RBSFormer and we further introduce data augmentation in the RAW domain and hybrid ensemble strategies to enhance our results. Experimental results demonstrate superior performance against the majority of methods both qualitatively and quantitatively. Our RBSFormer achieves 3rd place in terms of all the evaluation metrics both on the official validation and testing set with fewer parameters in the NTIRE 2024 challenge on Raw Image Super Resolution.

Related Material


[pdf]
[bibtex]
@InProceedings{Jiang_2024_CVPR, author = {Jiang, Siyuan and Xu, Senyan and Wang, Xingfu}, title = {RBSFormer: Enhanced Transformer Network for Raw Image Super-Resolution}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops}, month = {June}, year = {2024}, pages = {6479-6488} }