CoMoGAN: Continuous Model-Guided Image-to-Image Translation

Fabio Pizzati, Pietro Cerri, Raoul de Charette; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 14288-14298

Abstract


CoMoGAN is a continuous GAN relying on the unsupervised reorganization of the target data on a functional manifold. To that matter, we introduce a new Functional Instance Normalization layer and residual mechanism, which together disentangle image content from position on target manifold. We rely on naive physics-inspired models to guide the training while allowing private model/translations features. CoMoGAN can be used with any GAN backbone and allows new types of image translation, such as cyclic image translation like timelapse generation, or detached linear translation. On all datasets, it outperforms the literature. Our code is available in this page: https://github.com/cv-rits/CoMoGAN.

Related Material


[pdf] [supp] [arXiv]
[bibtex]
@InProceedings{Pizzati_2021_CVPR, author = {Pizzati, Fabio and Cerri, Pietro and de Charette, Raoul}, title = {CoMoGAN: Continuous Model-Guided Image-to-Image Translation}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year = {2021}, pages = {14288-14298} }