ABSTRACT Classification with a few samples of training set has been a long-standing issue in the field of polarimetric synthetic aperture radar (PolSAR) image analysis and processing. In fact, one of the most important challenges of the PolSAR image classification task is the number of labelled samples. In essence, training network by utilizing just few numbers of training samples cannot be led to the reliable result since the neural network is so sensitive to the number of training samples. In addition, in the real PolSAR image classification task, the huge number of training samples is not accessible. On the other hand, classification performance using just small number of labelled samples is not an accurate result. So, in this paper, aiming at the small number of training samples of the PolSAR image classification task, a novel ensemble self-supervised feature-learning (ESSFL) model is designed. The designed ESSFL can automatically extract PolSAR features conducive to PolSAR image classification with a small number of training samples. In addition, it can significantly decrease the dependence of neural network algorithms on large labelled samples of training set. First, to utilize the spatial–polarimetric features of PolSAR data perfectly, the EfficientNet-B0 is presented and utilized as the main section of the deep learning (DL) model to extract DL features of PolSAR data. Then, using an optimization function that constrains the cross-correlation matrix of various distortions of each sample to the identity matrix, the designed deep learning model can obtain the effective features of homogeneous samples gathering together and heterogeneous samples separating from each other in a self-supervised manner. Moreover, two ensemble learning (EL) models, feature-level and view-level ensemble, are proposed to increase the feature extraction capability and classification result by jointly using spatial features at different scales and polarimetric information at different bands. Finally, the stack of the obtained features and the main polarimetric information of PolSAR data can be input into classifier for the classification of PolSAR data. In this paper we use two different classifiers, the first one is the Random Forest (RF) classifier and the second one is the Support Vector Machine (SVM) classifier. The advantages of SVM include that it can be used to avoid the difficulties of using linear functions in the high-dimensional feature space, and the optimization problem is transformed into dual convex quadratic programs that are so usable for an actual image classification task. In addition, among all the available classification methods, random forests provide one of the highest accuracies. The random forest technique can also handle big data with numerous variables running into thousands. In fact, it can automatically balance data sets when a class is more infrequent than other classes in the data that is so important for a real PolSAR image classification task. Experimental results on three well-known PolSAR data sets illustrate that the designed ESSFL can extract more discriminant features using the designed deep learning model. In the end, the experiments prove that the designed ESSFL has a significant classification performance compared with different deep learning models in the case of small number of training samples, and also it can achieve a better result in the case of large number of training samples by comparison with the most of the deep learning models.