Welding defect prediction is the foundation for ensuring welding quality in gas tungsten arc welding (GTAW). In the prediction process, method based on molten pool vision is the most effective. Since the classification of molten pool defects relies on a substantial volume of labeled data, it is challenging for the models to be applied industrially. This paper presents an algorithm, FS-Classifier, that can achieve high prediction accuracy based on a limited amount of labeled data. The FS-Classifier comprises two stages: Firstly, an unsupervised training approach named RaP is designed to pre-train the feature extractor using extensive unlabeled daily datasets. The RaP consists of a rotation angle prediction task and a position prediction task, which ensure that the network focuses on salient features and precise elements, respectively. Secondly, the support vectors constructed from limited labeled data are used for the feature classifier. The input data is classified to certain class by computing its distances to support vector. The model achieves an accuracy of 94.5 % on the private dataset and 92.8 % on the public dataset for the six classes of defects using 5 % of labeled data volume. In addition, comparative experiments show that our method only requires 5 % of labeled data to achieve accuracy comparable to traditional supervised learning methods. The proposed algorithm addresses the issue of relying on a substantial amount of labeled data in welding process defect classification.