Abstract

Purpose Vehicle fault diagnosis is a key factor in ensuring the safe and efficient operation of the railway system. Due to the numerous vehicle categories and different fault mechanisms, there is an unbalanced fault category problem. Most of the current methods to solve this problem have complex algorithm structures, low efficiency and require prior knowledge. This study aims to propose a new method which has a simple structure and does not require any prior knowledge to achieve a fast diagnosis of unbalanced vehicle faults. Design/methodology/approach This study proposes a novel K-means with feature learning based on the feature learning K-means-improved cluster-centers selection (FKM-ICS) method, which includes the ICS and the FKM. Specifically, this study defines cluster centers approximation to select the initialized cluster centers in the ICS. This study uses improved term frequency-inverse document frequency to measure and adjust the feature word weights in each cluster, retaining the top τ feature words with the highest weight in each cluster and perform the clustering process again in the FKM. With the FKM-ICS method, clustering performance for unbalanced vehicle fault diagnosis can be significantly enhanced. Findings This study finds that the FKM-ICS can achieve a fast diagnosis of vehicle faults on the vehicle fault text (VFT) data set from a railway station in the 2017 (VFT) data set. The experimental results on VFT indicate the proposed method in this paper, outperforms several state-of-the-art methods. Originality/value This is the first effort to address the vehicle fault diagnostic problem and the proposed method performs effectively and efficiently. The ICS enables the FKM-ICS method to exclude the effect of outliers, solves the disadvantages of the fault text data contained a certain amount of noisy data, which effectively enhanced the method stability. The FKM enhances the distribution of feature words that discriminate between different fault categories and reduces the number of feature words to make the FKM-ICS method faster and better cluster for unbalanced vehicle fault diagnostic.

Highlights

  • With the rapid increase in the number of railway transport vehicles, inertial fault frequently occurs, which dramatically reduces the overall efficiency of vehicle operating (Yang et al, 2018)

  • This study finds that the feature learning K-means-improved cluster-centers selection (FKM-ICS) can achieve a fast diagnosis of vehicle faults on the vehicle fault text (VFT) data set from a railway station in the 2017 (VFT) data set

  • The FKM enhances the distribution of feature words that discriminate between different fault categories and reduces the number of feature words to make the FKM-ICS method faster and better cluster for unbalanced vehicle fault diagnostic

Read more

Summary

Introduction

With the rapid increase in the number of railway transport vehicles, inertial fault frequently occurs, which dramatically reduces the overall efficiency of vehicle operating (Yang et al, 2018). According to the research (Li et al, 2017), the main causes of vehicle faults are human, machine, environmental and management factors. The four factors are the major causes of vehicle faults and even passenger safety problems. This is an unbalanced fault category data that the vast majority of vehicle faults are caused by the same factors, while only a small number of faults are caused by the remaining factors. In the age of intelligent railways, the use of text mining and other machine learning algorithms to achieve intelligent diagnostic of unbalanced vehicle fault text (VFT) data is the current urgent technical method (Li et al, 2021)

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call