On Compositions of Transformations in Contrastive Self-Supervised Learning

doi:https://doi.org/10.1109/ICCV48922.2021.00944

On Compositions of Transformations in Contrastive Self-Supervised Learning

Authors	M. Patrick Y.M. Asano P. Kuznetsova R. Fong J.F. Henriques G. Zweig A. Vedaldi
Publication date	2021
Book title	2021 IEEE/CVF International Conference on Computer Vision
Book subtitle	proceedings : ICCV 2021 : 11-17 October 2021, virtual event
ISBN	9781665428132
ISBN (electronic)	9781665428125
Series	International Conference on Computer Vision
Event	2021 IEEE/CVF International Conference on Computer Vision
Pages (from-to)	9557-9567
Publisher	Los Alamitos, California: IEEE Computer Society
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	In the image domain, excellent representations can be learned by inducing invariance to content-preserving transformations via noise contrastive learning. In this paper, we generalize contrastive learning to a wider set of transformations, and their compositions, for which either invariance or distinctiveness is sought. We show that it is not immediately obvious how existing methods such as SimCLR can be extended to do so. Instead, we introduce a number of formal requirements that all contrastive formulations must satisfy, and propose a practical construction which satisfies these requirements. In order to maximise the reach of this analysis, we express all components of noise contrastive formulations as the choice of certain generalized transformations of the data (GDTs), including data sampling. We then consider videos as an example of data in which a large variety of transformations are applicable, accounting for the extra modalities – for which we analyze audio and text – and the dimension of time. We find that being invariant to certain transformations and distinctive to others is critical to learning effective video representations, improving the state-of-the-art for multiple benchmarks by a large margin, and even surpassing supervised pretraining. Code and pretrained models are available.
Document type	Conference contribution
Note	With supplemental items
Language	English
Published at	https://doi.org/10.1109/ICCV48922.2021.00944
Published at	https://openaccess.thecvf.com/content/ICCV2021/html/Patrick_On_Compositions_of_Transformations_in_Contrastive_Self-Supervised_Learning_ICCV_2021_paper.html
Other links	https://github.com/facebookresearch/GDT https://www.proceedings.com/61354.html
Downloads	Patrick_On_Compositions_of_Transformations_in_Contrastive_Self-Supervised_Learning_ICCV_2021_paper (Accepted author manuscript)
Supplementary materials	Patrick_On_Compositions_of_ICCV_2021_supplemental
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

On Compositions of Transformations in Contrastive Self-Supervised Learning