Universal Cross-Domain Retrieval: Generalizing Across Classes and Domains

Soumava Paul, Titir Dutta, Soma Biswas; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 12056-12064

Abstract


In this work, for the first time, we address the problem of universal cross-domain retrieval, where the test data can belong to classes or domains which are unseen during training. Due to dynamically increasing number of categories and practical constraint of training on every possible domain, which requires large amounts of data, generalizing to both unseen classes and domains is important. Towards that goal, we propose SnMpNet (Semantic Neighbourhood and Mixture Prediction Network), which incorporates two novel losses to account for the unseen classes and domains encountered during testing. Specifically, we introduce a novel Semantic Neighborhood loss to bridge the knowledge gap between seen and unseen classes and ensure that the latent space embedding of the unseen classes is semantically meaningful with respect to its neighboring classes. We also introduce a mix-up based supervision at image-level as well as semantic-level of the data for training with the Mixture Prediction loss, which helps in efficient retrieval when the query belongs to an unseen domain. These losses are incorporated on the SE-ResNet50 backbone to obtain SnMpNet. Extensive experiments on two large-scale datasets, Sketchy Extended and DomainNet, and thorough comparisons with state-of-the-art justify the effectiveness of the proposed model.

Related Material


[pdf] [supp] [arXiv]
[bibtex]
@InProceedings{Paul_2021_ICCV, author = {Paul, Soumava and Dutta, Titir and Biswas, Soma}, title = {Universal Cross-Domain Retrieval: Generalizing Across Classes and Domains}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2021}, pages = {12056-12064} }