Self-Supervised Models are Continual Learners

Elisa Ricci,Xavier Alameda-Pineda,Victor G. Turrisi Da Costa,Karteek Alahari,Julien Mairal,Enrico Fini

doi:10.1109/cvpr52688.2022.00940

Elisa Ricci, Xavier Alameda-Pineda + Show 4 more

Open Access

https://doi.org/10.1109/cvpr52688.2022.00940

Copy DOI

Abstract

Self-supervised models have been shown to produce comparable or better visual representations than their su-pervised counterparts when trained offline on unlabeled data at scale. However, their efficacy is catastrophically reduced in a Continual Learning (CL) scenario where data is presented to the model sequentially. In this paper, we show that self-supervised loss functions can be seamlessly converted into distillation mechanisms for CL by adding a predictor network that maps the current state of the repre-sentations to their past state. This enables us to devise a framework for Continual self-supervised visual representation Learning that (i) significantly improves the quality of the learned representations, (ii) is compatible with several state-of-the-art self-supervised objectives, and (iii) needs little to no hyperparameter tuning. We demonstrate the ef-fectiveness of our approach empirically by training six pop-ular self-supervised models in various CL settings. Code: github.com/DonkeyShot21/cassle.

Full Text