Abstract Fault diagnosis transfer learning models commonly employ deep neural networks (DNN) to analyze time-frequency features. However, excessively deep neural networks can result in diminished generalization capabilities, leading to subpar performance of the model across various working conditions. Furthermore, inappropriate domain adaptation strategies significantly constrain the accuracy of the model. To address this issue, a Robustly Optimized Residual-Network and Vision Transformer Domain Adaptation Model (RoReViTDAM) is proposed in this article, combining Wavelet Packet Transform (WPT), residual networks, and self-attention mechanisms. Firstly, the WPT is utilized to construct Multi-band Wavelet Coefficient Matrix (MWCM) and corresponding Multi-band Wavelet Coefficient Time-Frequency Feature Matrix (MWSM) with small size and feature aggregation. Subsequently, a shallow Robustly Optimized Residual Network (RoResNet) is designed to effectively extract features from MWCM, considering the spatial distance dependencies of features. Additionally, ViT (Vision Transformer) is employed for time-frequency global feature extraction from MWSM. Furthermore, Domain Adversarial Neural Network (DANN) and Multi-Kernel Maximum Mean Discrepancy (MK-MMD) are employed to extract domain-invariant features from signals of different operating conditions and fault types. At last, three fault diagnosis experiments are conducted in multi-condition scenarios of bearings. The experimental results illustrate the superiority and effectiveness of the proposed model.
Read full abstract