Reconsidering Representation Alignment for Multi-View Clustering

Daniel J. Trosten, Sigurd Lokse, Robert Jenssen, Michael Kampffmeyer; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 1255-1265

Abstract


Aligning distributions of view representations is a core component of today's state of the art models for deep multi-view clustering. However, we identify several drawbacks with naively aligning representation distributions. We demonstrate that these drawbacks both lead to less separable clusters in the representation space, and inhibit the model's ability to prioritize views. Based on these observations, we develop a simple baseline model for deep multi-view clustering. Our baseline model avoids representation alignment altogether, while performing similar to, or better than, the current state of the art. We also expand our baseline model by adding a contrastive learning component. This introduces a selective alignment procedure that preserves the model's ability to prioritize views. Our experiments show that the contrastive learning component enhances the baseline model, improving on the current state of the art by a large margin on several datasets.

Related Material


[pdf] [supp] [arXiv]
[bibtex]
@InProceedings{Trosten_2021_CVPR, author = {Trosten, Daniel J. and Lokse, Sigurd and Jenssen, Robert and Kampffmeyer, Michael}, title = {Reconsidering Representation Alignment for Multi-View Clustering}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year = {2021}, pages = {1255-1265} }