Active Self-Supervised Learning: A Few Low-Cost Relationships Are All You Need

Vivien Cabannes, Leon Bottou, Yann Lecun, Randall Balestriero; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023, pp. 16274-16283

Abstract


Self-Supervised Learning (SSL) has emerged as the solution of choice to learn transferable representations from unlabeled data. However, SSL requires to build samples that are known to be semantically akin, i.e. positive views. Requiring such knowledge is the main limitation of SSL and is often tackled by ad-hoc strategies e.g. applying known data-augmentations to the same input. In this work, we generalize and formalize this principle through Positive Active Learning (PAL) where an oracle queries semantic relationships between samples. PAL achieves three main objectives. First, it is a theoretically grounded learning framework that encapsulates standard SSL but also supervised and semi-supervised learning depending on the employed oracle. Second, it provides a consistent algorithm to embed a priori knowledge, e.g. some observed labels, into any SSL losses without any change in the training pipeline. Third, it provides a proper active learning framework yielding low-cost solutions to annotate datasets, arguably bringing the gap between theory and practice of active learning that is based on simple-to-answer-by-non-experts queries of semantic relationships between inputs.

Related Material


[pdf] [supp] [arXiv]
[bibtex]
@InProceedings{Cabannes_2023_ICCV, author = {Cabannes, Vivien and Bottou, Leon and Lecun, Yann and Balestriero, Randall}, title = {Active Self-Supervised Learning: A Few Low-Cost Relationships Are All You Need}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2023}, pages = {16274-16283} }