Reference-Based Face Super-Resolution Using the Spatial Transformer

Varun Ramesh Jois, Antonella DiLillo, James Storer; Proceedings of the Asian Conference on Computer Vision (ACCV), 2024, pp. 3689-3705

Abstract


Face super-resolution is the task of increasing the resolution of an image containing a face thereby adding finer detail. It is a ubiquitous task in many computer vision applications and quite often the user isnt even aware that it is being performed. However, doing it with high fidelity is challenging as it is an ill-posed problem. In this paper we present a reference-based solution for face super-resolution that uses higher resolution reference images to aid in the task. We show an alignment module based on the spatial transformer that is considerably more stable than the popular deformable convolutions. We also show an aggregation function that can take good quality information from the reference images when available or suppress the function when such information is unavailable. Finally, we show that our relatively smaller model can achieve state of the art results on multiple datasets. The source code is available at https://github.com/varun-jois/FSRST.

Related Material


[pdf]
[bibtex]
@InProceedings{Jois_2024_ACCV, author = {Jois, Varun Ramesh and DiLillo, Antonella and Storer, James}, title = {Reference-Based Face Super-Resolution Using the Spatial Transformer}, booktitle = {Proceedings of the Asian Conference on Computer Vision (ACCV)}, month = {December}, year = {2024}, pages = {3689-3705} }