Focusing Visual Relation Detection on Relevant Relations with Prior Potentials

Francois PLESSE, Alexandru Ginsca, Bertrand DELEZOIDE, Francoise PRETEUX; Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2020, pp. 2980-2989

Abstract


Understanding images relies on the understanding of how visible objects are linked to each other. Current approaches of Visual Relation Detection (VRD) are hindered by the high frequency of some relations: when an important focus is put on them, more meaningful ones are overlooked. We address this challenge by learning the relative relevance of relations, and integrating this term into a novel scene graph extraction scheme. We show that this allows our model to predict relations on fewer and more relevant object pairs. It outperforms MotifNet, a state of the art model, on the Visual Genome dataset. It increases the Class Macro recall, the metric we propose to use, from 38.1% to 44.4%. In addition, we propose a new split of Visual Genome, with a more balanced relation distribution, emphasizing on the detection of uncommon relations and validates the use of the previous metric. On this set, our model outperforms MotifNet on all metrics, e.g. from 39.6% to 44.0% at 10 predictions per image on the relation classification task.

Related Material


[pdf] [video]
[bibtex]
@InProceedings{PLESSE_2020_WACV,
author = {PLESSE, Francois and Ginsca, Alexandru and DELEZOIDE, Bertrand and PRETEUX, Francoise},
title = {Focusing Visual Relation Detection on Relevant Relations with Prior Potentials},
booktitle = {Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)},
month = {March},
year = {2020}
}