Scene Designer: A Unified Model for Scene Search and Synthesis From Sketch

Leo Sampaio Ferraz Ribeiro, Tu Bui, John Collomosse, Moacir Ponti; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, 2021, pp. 2424-2433

Abstract


Scene Designer is a novel method for searching and generating images using free-hand sketches of scene compositions; i.e. drawings that describe both the appearance and relative positions of objects. Our core contribution is a single unified model to learn both a cross-modal search embedding for matching sketched compositions to images, and an object embedding for layout synthesis. We show that a graph neural network (GNN) followed by Transformer under our novel contrastive learning setting is required to allow learning correlations between object type, appearance and arrangement, driving a mask generation module that synthesises coherent scene layouts, whilst also delivering state of the art sketch based visual search of scenes.

Related Material


[pdf] [supp] [arXiv]
[bibtex]
@InProceedings{Ribeiro_2021_ICCV, author = {Ribeiro, Leo Sampaio Ferraz and Bui, Tu and Collomosse, John and Ponti, Moacir}, title = {Scene Designer: A Unified Model for Scene Search and Synthesis From Sketch}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops}, month = {October}, year = {2021}, pages = {2424-2433} }