Abstract

To solve the fault diagnosis difficulty of bearings with small data samples, a novel multi-information fusion vision transformer (ViT) model based on time–frequency representation (TFR) maps is proposed in this paper. The original vibration signal is decomposed into different scale sub-signals by the discrete wavelet transforms (DWTs), and the continuous wavelet transforms (CWTs) are used to transform these different scale sub-signals into time–frequency representation (TFR) maps, which are concatenated to input to the ViT model to diagnose the bearing fault. Through the multifaceted experiment analysis on the fault diagnosis of bearings with small data samples, the diagnosis results demonstrate that the proposed multi-information fusion ViT model can diagnose the fault of bearings with small data samples, with strong generalization and robustness; its average diagnosis accuracy achieved 99.85%, and it was superior to the other fault diagnosis methods, such as the multi-information fusion CNN, ViT model based on one-dimensional vibration signal, and ViT model based on the TFR of the original vibration signal.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call