An Efficient Rolling Bearing Fault Diagnosis Method Based on Spark and Improved Random Forest Algorithm

Lanjun Wan,Changyun Li,Xiaojun Deng,Kun Gong,Xinpan Yuan,Gen Zhang

doi:10.1109/access.2021.3063929

Lanjun Wan, Changyun Li + Show 4 more

Open Access

https://doi.org/10.1109/access.2021.3063929

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 107	License type: CC BY 4.0

Affiliation: Hunan University of Technology

Abstract

The random forest (RF) algorithm is a typical representative of ensemble learning, which is widely used in rolling bearing fault diagnosis. In order to solve the problems of slower diagnosis speed and repeated voting of traditional RF algorithm in rolling bearing fault diagnosis under the big data environment, an efficient rolling bearing fault diagnosis method based on Spark and improved random forest (IRF) algorithm is proposed. By eliminating the decision trees with low classification accuracy and those prone to repeated voting in the original RF, an improved RF with faster diagnosis speed and higher classification accuracy is constructed. For the massive rolling bearing vibration data, in order to improve the training speed and diagnosis speed of the rolling bearing fault diagnosis model, the IRF algorithm is parallelized on the Spark platform. First, an original RF model is obtained by training multiple decision trees in parallel. Second, the decision trees with low classification accuracy in the original RF model are filtered. Third, all path information of the reserved decision trees is obtained in parallel. Fourth, a decision tree similarity matrix is constructed in parallel to eliminate the decision trees which are prone to repeated voting. Finally, an IRF model which can diagnose rolling bearing faults quickly and effectively is obtained. A series of experiments are carried out to evaluate the effectiveness of the proposed rolling bearing fault diagnosis method based on Spark and IRF algorithm. The results show that the proposed method can not only achieve good fault diagnosis accuracy, but also have fast model training speed and fault diagnosis speed for large-scale rolling bearing datasets.

Highlights

Rolling bearings are the most critical and damaged components in rotating machinery, the availability, reliability, and productivity of rotating machinery depend on the health state of rolling bearings, the rolling bearing fault diagnosis is very vital to the stable, reliable, and efficient operation of rotating machinery [1], [2]
To solve the problems of slower diagnosis speed and repeated voting of traditional random forest (RF) algorithm in rolling bearing fault diagnosis under the big data environment, an efficient rolling bearing fault diagnosis method based on Spark and improved random forest (IRF) algorithm is proposed in this paper, which can significantly improve the speed of fault diagnosis, and improve the fault diagnosis accuracy to a certain extent
The sub-forest optimization significantly reduces the diagnosis time of rolling bearing fault diagnosis model based on IRF algorithm, and improves its diagnosis accuracy to a certain extent

Summary

Introduction

Rolling bearings are the most critical and damaged components in rotating machinery, the availability, reliability, and productivity of rotating machinery depend on the health state of rolling bearings, the rolling bearing fault diagnosis is very vital to the stable, reliable, and efficient operation of rotating machinery [1], [2]. With the rapid development of machine learning algorithms and deep learning algorithms, the datadriven fault diagnosis methods have been paid more and more attention. Li et al [3] used hierarchical symbol dynamic entropy and binary tree support vector machine for rolling bearing fault diagnosis. Zhou et al [4] proposed a rolling bearing fault diagnosis method based on K-Means clustering algorithm and k-nearest neighbor algorithm. Chen et al [5] combined permutation entropy of variational mode

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Efficient Rolling Bearing Fault Diagnosis Method Based on Spark and Improved Random Forest Algorithm

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Research on rolling bearing fault diagnosis based onTVFEMD-SVD and random forest algorithm
Cheng Yang
Journal of Physics: Conference Series | VOL. 2296
Cheng YangCheng Yang
01 Jun 2022
Journal of Physics: Conference Series | VOL. 2296

A Novel Rolling Bearing Fault Diagnosis Method Based on Adaptive Feature Selection and Clustering
Jingbao Hou ... Hai Gong
IEEE Access | VOL. 9
Jingbao Hou, et. al.Jingbao Hou ... Hai Gong
01 Jan 2020
IEEE Access | VOL. 9

Rolling Bearing Fault Diagnosis Method Based On LMD Entropy Feature Fusion
Bo Deng ... Jingchao Li
-
Bo Deng, et. al.Bo Deng ... Jingchao Li
15 Oct 2021
15 Oct 2021

Deep learning based on rolling bearing fault diagnosis method
Nianyun Liu ... Weiwei Song
Transactions on Engineering and Technology Research | VOL. 1
Nianyun Liu, et. al.Nianyun Liu ... Weiwei Song
11 Dec 2023
Transactions on Engineering and Technology Research | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Efficient Rolling Bearing Fault Diagnosis Method Based on Spark and Improved Random Forest Algorithm

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access