Cross-domain self-supervised contrastive learning with multi-scale feature fusion for bearing fault diagnosis under limited labels

  • Abstract
  • References
  • Similar Papers
Abstract
Translate article icon Translate Article Star icon
Take notes icon Take Notes

Abstract Deep learning is extensively used in fault diagnosis. Actual implementations frequently encounter the issue of limited labeled data. This paper proposes a cross-domain self-supervised contrastive learning method with multi-scale feature fusion (CD-MSSCL) for bearing fault diagnosis under limited samples. The method first employs a specialized time-domain data augmentation strategy to capture the complexity of industrial vibration signals and enhance model generalization. A designed encoder backbone network (MSF-SEA) combines multi-scale features with a SENet attention mechanism using pyramid fusion. The network performs self-supervised pre-training on unlabeled samples to effectively capture multi-frequency fault features critical for bearing fault diagnosis. Limited labeled samples then fine-tune the model to transfer pre-trained features to specific tasks. Experimental results show CD-MSSCL outperforms traditional deep learning and current contrastive learning methods in accuracy and domain adaptation under limited labels. The approach significantly reduces data collection and labeling costs through effective unsupervised knowledge extraction and transfer.

ReferencesShowing 10 of 30 papers
  • Cite Count Icon 21
  • 10.1109/tii.2022.3229130
Intelligent Fault Diagnosis With Noisy Labels via Semisupervised Learning on Industrial Time Series
  • Jun 1, 2023
  • IEEE Transactions on Industrial Informatics
  • Cheng Cheng + 3 more

  • Cite Count Icon 1
  • 10.1016/j.neucom.2025.130126
Self-supervised progressive learning for fault diagnosis under limited labeled data and varying conditions
  • Jul 1, 2025
  • Neurocomputing
  • Qiuyu Song + 3 more

  • Cite Count Icon 25
  • 10.1016/j.oceaneng.2022.113437
A self-supervised contrastive learning framework with the nearest neighbors matching for the fault diagnosis of marine machinery
  • Jan 5, 2023
  • Ocean Engineering
  • Ruihan Wang + 2 more

  • Cite Count Icon 179
  • 10.1007/s10462-021-09993-z
A novel fault diagnosis method based on CNN and LSTM and its application in fault diagnosis for complex systems
  • Apr 2, 2021
  • Artificial Intelligence Review
  • Ting Huang + 4 more

  • Cite Count Icon 2
  • 10.1016/j.simpat.2024.103058
Adversarial domain adaptation based on contrastive learning for bearings fault diagnosis
  • Feb 1, 2025
  • Simulation Modelling Practice and Theory
  • Xiaolei Pan + 3 more

  • Cite Count Icon 169
  • 10.1016/j.ress.2021.108126
Self-supervised pretraining via contrast learning for intelligent incipient fault detection of bearings
  • Oct 14, 2021
  • Reliability Engineering & System Safety
  • Yifei Ding + 3 more

  • Cite Count Icon 53
  • 10.1016/j.aei.2023.102304
Time-frequency supervised contrastive learning via pseudo-labeling: An unsupervised domain adaptation network for rolling bearing fault diagnosis under time-varying speeds
  • Dec 11, 2023
  • Advanced Engineering Informatics
  • Bin Pang + 4 more

  • Cite Count Icon 20
  • 10.1016/j.knosys.2023.111229
Self-supervised learning-based dual-classifier domain adaptation model for rolling bearings cross-domain fault diagnosis
  • Nov 26, 2023
  • Knowledge-Based Systems
  • Quansheng Jiang + 5 more

  • Cite Count Icon 32
  • 10.1016/j.eswa.2023.123080
Fault diagnosis of wind turbine gearbox under limited labeled data through temporal predictive and similarity contrast learning embedded with self-attention mechanism
  • Dec 27, 2023
  • Expert Systems with Applications
  • Yunyi Zhu + 3 more

  • Cite Count Icon 18
  • 10.1109/tim.2023.3279453
A Random Forest and Model-Based Hybrid Method of Fault Diagnosis for Satellite Attitude Control Systems
  • Jan 1, 2023
  • IEEE Transactions on Instrumentation and Measurement
  • Shaozhi Chen + 4 more

Similar Papers
  • Research Article
  • Cite Count Icon 1459
  • 10.1109/tpami.2020.2992393
Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey.
  • May 4, 2020
  • IEEE Transactions on Pattern Analysis and Machine Intelligence
  • Longlong Jing + 1 more

Large-scale labeled data are generally required to train deep neural networks in order to obtain better performance in visual feature learning from images or videos for computer vision applications. To avoid extensive cost of collecting and annotating large-scale datasets, as a subset of unsupervised learning methods, self-supervised learning methods are proposed to learn general image and video features from large-scale unlabeled data without using any human-annotated labels. This paper provides an extensive review of deep learning-based self-supervised general visual feature learning methods from images or videos. First, the motivation, general pipeline, and terminologies of this field are described. Then the common deep neural network architectures that used for self-supervised learning are summarized. Next, the schema and evaluation metrics of self-supervised learning methods are reviewed followed by the commonly used datasets for images, videos, audios, and 3D data, as well as the existing self-supervised visual feature learning methods. Finally, quantitative performance comparisons of the reviewed methods on benchmark datasets are summarized and discussed for both image and video feature learning. At last, this paper is concluded and lists a set of promising future directions for self-supervised visual feature learning.

  • Research Article
  • Cite Count Icon 41
  • 10.1016/j.jpowsour.2021.230584
Self-supervised reinforcement learning-based energy management for a hybrid electric vehicle
  • Dec 1, 2021
  • Journal of Power Sources
  • Chunyang Qi + 7 more

Self-supervised reinforcement learning-based energy management for a hybrid electric vehicle

  • Research Article
  • Cite Count Icon 1
  • 10.1016/j.knosys.2024.112090
Spatiotemporal self-supervised predictive learning for atmospheric variable prediction via multi-group multi-attention
  • Jun 13, 2024
  • Knowledge-Based Systems
  • Zhensheng Shi + 2 more

Spatiotemporal self-supervised predictive learning for atmospheric variable prediction via multi-group multi-attention

  • Research Article
  • Cite Count Icon 22
  • 10.1016/j.compag.2023.107967
CLA: A self-supervised contrastive learning method for leaf disease identification with domain adaptation
  • Jun 9, 2023
  • Computers and Electronics in Agriculture
  • Ruzhun Zhao + 2 more

CLA: A self-supervised contrastive learning method for leaf disease identification with domain adaptation

  • Research Article
  • Cite Count Icon 19
  • 10.1109/lwc.2022.3217292
Self-Supervised RF Signal Representation Learning for NextG Signal Classification With Deep Learning
  • Jan 1, 2023
  • IEEE Wireless Communications Letters
  • Kemal Davaslioglu + 4 more

Deep learning (DL) finds rich applications in the wireless domain to improve spectrum awareness. Typically, DL models are either randomly initialized following a statistical distribution or pretrained on tasks from other domains in the form of transfer learning without accounting for the unique characteristics of wireless signals. Self-supervised learning (SSL) enables the learning of useful representations from Radio Frequency (RF) signals themselves even when only limited training data samples with labels are available. We present a self-supervised RF signal representation learning method and apply it to the automatic modulation recognition (AMR) task by specifically formulating a set of transformations to capture the wireless signal characteristics. We show that the sample efficiency (the number of labeled samples needed to achieve a certain performance) of AMR can be significantly increased (almost an order of magnitude) by learning signal representations with SSL. This translates to substantial time and cost savings. Furthermore, SSL increases the model accuracy compared to the state-of-the-art DL methods and maintains high accuracy when limited training data is available.

  • Research Article
  • Cite Count Icon 34
  • 10.1016/j.media.2022.102539
CS-CO: A Hybrid Self-Supervised Visual Representation Learning Method for H&E-stained Histopathological Images.
  • Oct 1, 2022
  • Medical Image Analysis
  • Pengshuai Yang + 6 more

CS-CO: A Hybrid Self-Supervised Visual Representation Learning Method for H&E-stained Histopathological Images.

  • Research Article
  • Cite Count Icon 4
  • 10.34133/plantphenomics.0037
Benchmarking Self-Supervised Contrastive Learning Methods for Image-Based Plant Phenotyping
  • Jan 1, 2023
  • Plant Phenomics
  • Franklin C Ogidi + 2 more

The rise of self-supervised learning (SSL) methods in recent years presents an opportunity to leverage unlabeled and domain-specific datasets generated by image-based plant phenotyping platforms to accelerate plant breeding programs. Despite the surge of research on SSL, there has been a scarcity of research exploring the applications of SSL to image-based plant phenotyping tasks, particularly detection and counting tasks. We address this gap by benchmarking the performance of 2 SSL methods—momentum contrast (MoCo) v2 and dense contrastive learning (DenseCL)—against the conventional supervised learning method when transferring learned representations to 4 downstream (target) image-based plant phenotyping tasks: wheat head detection, plant instance detection, wheat spikelet counting, and leaf counting. We studied the effects of the domain of the pretraining (source) dataset on the downstream performance and the influence of redundancy in the pretraining dataset on the quality of learned representations. We also analyzed the similarity of the internal representations learned via the different pretraining methods. We find that supervised pretraining generally outperforms self-supervised pretraining and show that MoCo v2 and DenseCL learn different high-level representations compared to the supervised method. We also find that using a diverse source dataset in the same domain as or a similar domain to the target dataset maximizes performance in the downstream task. Finally, our results show that SSL methods may be more sensitive to redundancy in the pretraining dataset than the supervised pretraining method. We hope that this benchmark/evaluation study will guide practitioners in developing better SSL methods for image-based plant phenotyping.

  • Research Article
  • 10.1016/j.brainresbull.2022.03.007
Many heads are better than one: A multiscale neural information feature fusion framework for spatial route selections decoding from multichannel neural recordings of pigeons
  • Mar 12, 2022
  • Brain Research Bulletin
  • Mengmeng Li + 4 more

Many heads are better than one: A multiscale neural information feature fusion framework for spatial route selections decoding from multichannel neural recordings of pigeons

  • Research Article
  • Cite Count Icon 7
  • 10.1088/1361-6501/acadf7
Investigation of deep transfer learning for cross-turbine diagnosis of wind turbine faults
  • Jan 24, 2023
  • Measurement Science and Technology
  • Ping Xie + 4 more

Data-driven fault diagnosis of wind turbines has gained popularity, and various deep learning models have been developed accordingly with massive amounts of data and achieved an excellent diagnosis performance. However, most existing deep learning models require a similar distribution of both training and testing data, thus the trained model cannot generalize new wind turbines with different data distributions. In addition, there are insufficient fault data in practice, and therefore the cost of training a new model from scratch is extremely high. To solve these problems, a cross-turbine fault diagnosis method based on deep transfer learning is proposed for wind turbines with the available supervisory control and data acquisition (SCADA) data. To better capture the spatial features of SCADA data, a deep multi-scale residual attention convolutional neural network (DMRACNN) is first designed. Then, the distribution differences between the source and target domain data are aligned at feature level. Specifically, we investigate the transfer performance of four different domain adaptation metrics. We evaluate our proposed method using SCADA data from two wind turbines to compare the diagnostic performance of four basic networks combined with four transfer metrics. Compared with traditional deep learning methods, our proposed DMRACNN achieved significant performance improvements. A cross-validation experiment using two turbines demonstrates the strong generalization ability of the proposed method.

  • Research Article
  • Cite Count Icon 27
  • 10.1016/j.bspc.2022.104305
LiM-Net: Lightweight multi-level multiscale network with deep residual learning for automatic liver segmentation in CT images
  • Oct 21, 2022
  • Biomedical Signal Processing and Control
  • Devidas T Kushnure + 2 more

LiM-Net: Lightweight multi-level multiscale network with deep residual learning for automatic liver segmentation in CT images

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 17
  • 10.3390/electronics12030768
Fault Diagnosis for Rolling Bearings Based on Multiscale Feature Fusion Deep Residual Networks
  • Feb 3, 2023
  • Electronics
  • Xiangyang Wu + 2 more

Deep learning, due to its excellent feature-adaptive capture ability, has been widely utilized in the fault diagnosis field. However, there are two common problems in deep-learning-based fault diagnosis methods: (1) many researchers attempt to deepen the layers of deep learning models for higher diagnostic accuracy, but degradation problems of deep learning models often occur; and (2) the use of multiscale features can easily be ignored, which makes the extracted data features lack diversity. To deal with these problems, a novel multiscale feature fusion deep residual network is proposed in this paper for the fault diagnosis of rolling bearings, one which contains multiple multiscale feature fusion blocks and a multiscale pooling layer. The multiple multiscale feature fusion block is designed to automatically extract the multiscale features from raw signals, and further compress them for higher dimensional feature mapping. The multiscale pooling layer is constructed to fuse the extracted multiscale feature mapping. Two famous rolling bearing datasets are adopted to evaluate the diagnostic performance of the proposed model. The comparison results show that the diagnostic performance of the proposed model is superior to not only several popular models, but also other advanced methods in the literature.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 12
  • 10.1038/s42003-023-05310-2
A self-supervised deep learning method for data-efficient training in genomics
  • Sep 11, 2023
  • Communications Biology
  • Hüseyin Anil Gündüz + 7 more

Deep learning in bioinformatics is often limited to problems where extensive amounts of labeled data are available for supervised classification. By exploiting unlabeled data, self-supervised learning techniques can improve the performance of machine learning models in the presence of limited labeled data. Although many self-supervised learning methods have been suggested before, they have failed to exploit the unique characteristics of genomic data. Therefore, we introduce Self-GenomeNet, a self-supervised learning technique that is custom-tailored for genomic data. Self-GenomeNet leverages reverse-complement sequences and effectively learns short- and long-term dependencies by predicting targets of different lengths. Self-GenomeNet performs better than other self-supervised methods in data-scarce genomic tasks and outperforms standard supervised training with ~10 times fewer labeled training data. Furthermore, the learned representations generalize well to new datasets and tasks. These findings suggest that Self-GenomeNet is well suited for large-scale, unlabeled genomic datasets and could substantially improve the performance of genomic models.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 9
  • 10.3389/fmicb.2022.996400
An AI-based approach for detecting cells and microbial byproducts in low volume scanning electron microscope images of biofilms.
  • Dec 1, 2022
  • Frontiers in Microbiology
  • Dilanga Abeyrathna + 6 more

Microbially induced corrosion (MIC) of metal surfaces caused by biofilms has wide-ranging consequences. Analysis of biofilm images to understand the distribution of morphological components in images such as microbial cells, MIC byproducts, and metal surfaces non-occluded by cells can provide insights into assessing the performance of coatings and developing new strategies for corrosion prevention. We present an automated approach based on self-supervised deep learning methods to analyze Scanning Electron Microscope (SEM) images and detect cells and MIC byproducts. The proposed approach develops models that can successfully detect cells, MIC byproducts, and non-occluded surface areas in SEM images with a high degree of accuracy using a low volume of data while requiring minimal expert manual effort for annotating images. We develop deep learning network pipelines involving both contrastive (Barlow Twins) and non-contrastive (MoCoV2) self-learning methods and generate models to classify image patches containing three labels-cells, MIC byproducts, and non-occluded surface areas. Our experimental results based on a dataset containing seven grayscale SEM images show that both Barlow Twin and MoCoV2 models outperform the state-of-the-art supervised learning models achieving prediction accuracy increases of approximately 8 and 6%, respectively. The self-supervised pipelines achieved this superior performance by requiring experts to annotate only ~10% of the input data. We also conducted a qualitative assessment of the proposed approach using experts and validated the classification outputs generated by the self-supervised models. This is perhaps the first attempt toward the application of self-supervised learning to classify biofilm image components and our results show that self-supervised learning methods are highly effective for this task while minimizing the expert annotation effort.

  • Research Article
  • Cite Count Icon 11
  • 10.3390/rs14153538
Water Body Extraction in Remote Sensing Imagery Using Domain Adaptation-Based Network Embedding Selective Self-Attention and Multi-Scale Feature Fusion
  • Jul 23, 2022
  • Remote Sensing
  • Jiahang Liu + 1 more

A water body is a common object in remote sensing images and high-quality water body extraction is important for some further applications. With the development of deep learning (DL) in recent years, semantic segmentation technology based on deep convolution neural network (DCNN) brings a new way for automatic and high-quality body extraction from remote sensing images. Although several methods have been proposed, there exist two major problems in water body extraction, especially for high resolution remote sensing images. One is that it is difficult to effectively detect both large and small water bodies simultaneously and accurately predict the edge position of water bodies with DCNN-based methods, and the other is that DL methods need a large number of labeled samples which are often insufficient in practical application. In this paper, a novel SFnet-DA network based on the domain adaptation (DA) embedding selective self-attention (SSA) mechanism and multi-scale feature fusion (MFF) module is proposed to deal with these problems. Specially, the SSA mechanism is used to increase or decrease the space detail and semantic information, respectively, in the bottom-up branches of the network by selective feature enhancement, thus it can improve the detection capability of water bodies with drastic scale change and can prevent the prediction from being affected by other factors, such as roads and green algae. Furthermore, the MFF module is used to accurately acquire edge information by changing the number of the channel of advanced feature branches with a unique fusion method. To skip the labeling work, SFnet-DA reduces the difference in feature distribution between labeled and unlabeled datasets by building an adversarial relationship between the feature extractor and the domain classifier, so that the trained parameters of the labeled datasets can be directly used to predict the unlabeled images. Experimental results demonstrate that the proposed SFnet-DA has better performance on water body segmentation than state-of-the-art methods.

  • Research Article
  • Cite Count Icon 13
  • 10.1109/lgrs.2022.3198135
Exploring PolSAR Images Representation via Self-Supervised Learning and Its Application on Few-Shot Classification
  • Jan 1, 2022
  • IEEE Geoscience and Remote Sensing Letters
  • Wu Zhang + 2 more

Deep learning methods have attracted much attention in the field of polarimetric synthetic aperture radar (PolSAR) image classification over the past few years. However, for supervised learning based methods, it is quite difficult to get large amounts of high-quality and labeled PolSAR data in real applications. In addition, there is a problem of poor generalization for the method of specific supervision labels. To solve the above issue, we explore how to learn representations from unlabeled data from a new perspective. In this letter, a self-supervised PolSAR representation learning (SSPRL) method is proposed. Different from supervised learning based methods, SSPRL aims to learn PolSAR image representations via unsupervised learning approach. Specifically, a self-supervised learning (SSL) method without negative samples is explored and a positive sample generation approach and a novel encoder architecture designed for PolSAR images are proposed. Moreover, mixup is implemented as a regularization strategy. Further the convolutional encoder is utilized to transfer the feature representation from the unlabeled PolSAR data to the downstream task, that is, to achieve the few-shot PolSAR classification. Comparative experimental results on two widely-used PolSAR benchmark datasets verify the effectiveness of proposed method and demonstrate that SSPRL produces impressive performance on few-shot classification task compared with state-of-the-art algorithms.

More from: Measurement Science and Technology
  • New
  • Research Article
  • 10.1088/1361-6501/ae18ee
Characterization of shear-flow behaviors of rock fractures using a newly-developed shear-flow apparatus
  • Nov 7, 2025
  • Measurement Science and Technology
  • Rihua Jiang + 5 more

  • New
  • Research Article
  • 10.1088/1361-6501/ae1cd8
Improved Structural Sparse Representation and Quantization Constraint Prior-based Medical Image Compression Artifact Reduction: A Grey Wolf Optimization-based Approach
  • Nov 7, 2025
  • Measurement Science and Technology
  • Susmita Bhattacharyya + 2 more

  • New
  • Research Article
  • 10.1088/1361-6501/ae185c
A review on in-situ monitoring of the temperature field in metal-based laser additive manufacturing
  • Nov 7, 2025
  • Measurement Science and Technology
  • Longchao Cao + 8 more

  • New
  • Research Article
  • 10.1088/1361-6501/ae10d3
Enhancing defect detection with diffusion model
  • Nov 7, 2025
  • Measurement Science and Technology
  • Ziming Song + 3 more

  • New
  • Research Article
  • 10.1088/1361-6501/ae1858
IGD-YOLOv8s: insulator defect detection via iterative attention and generalized dynamic feature pyramids
  • Nov 7, 2025
  • Measurement Science and Technology
  • Zhiqin Zhang + 5 more

  • New
  • Research Article
  • 10.1088/1361-6501/ae10ca
Research on heat conduction performance evaluation of thermal grease based on physical constraints of 3D CNN
  • Nov 6, 2025
  • Measurement Science and Technology
  • Cao Weihua + 1 more

  • New
  • Research Article
  • 10.1088/1361-6501/ae10cc
Design and application of a passive RFID tag-based temperature measurement system with a communication range of 100 m
  • Nov 6, 2025
  • Measurement Science and Technology
  • Qishun Li + 6 more

  • New
  • Research Article
  • 10.1088/1361-6501/ae1c62
Position Estimation Enhancement and Robust Resonant Frequency Tracking Control Strategy in Linear Oscillating Machines
  • Nov 6, 2025
  • Measurement Science and Technology
  • Yuqiu Zhang + 2 more

  • New
  • Research Article
  • 10.1088/1361-6501/ae185a
A holistic measurement and profile evaluation method for spur gears based on line-structured light
  • Nov 6, 2025
  • Measurement Science and Technology
  • Tao Wang + 5 more

  • New
  • Research Article
  • 10.1088/1361-6501/ae1c5c
An uneven maximum classifier discrepancy rolling bearing transfer fault diagnosis method combining GCN and KAN
  • Nov 6, 2025
  • Measurement Science and Technology
  • Chenhui Qian + 5 more

Save Icon
Up Arrow
Open/Close
  • Ask R Discovery Star icon
  • Chat PDF Star icon

AI summaries and top papers from 250M+ research sources.

Search IconWhat is the difference between bacteria and viruses?
Open In New Tab Icon
Search IconWhat is the function of the immune system?
Open In New Tab Icon
Search IconCan diabetes be passed down from one generation to the next?
Open In New Tab Icon