Due to the large span of flight delay, numerous influencing factors and large number of flights, the collected flight delay data is characterized by large-scale and imbalance, which is considered as one of the largest challenges in balanced flight delay data classification by using conventional classification algorithms. To solve this problem, an adaptive multi-class classification approach of imbalanced flight delay data using synthetic minority over-sampling technique (SMOTE)-based convolutional neural network with sparrow search algorithm, namely SSA-LTCNN is proposed in this paper. Firstly, the SMOTE is used to reduce the imbalance between the classes of the flight delay data. Then, the qualitative analysis method is employed to determine the key parameters of the CNN, which are optimized by using SSA to establish an optimized CNN model. Finally, the optimized CNN model with more effective feature representation ability is employed to extract discriminative features from the balanced flight delay data automatically. The obtained features in the last fully connected layer of the optimized CNN model are used for flight delay classification. The MIT-BIH Arrhythmia Data is selected to verify the effectiveness of the proposed SSA-LTCNN method and the confusion matrix, classification accuracy, precision, recall and F1 score are also used to evaluate the performance of the proposed SSA-LTCNN method, which can get better performance than RF, DT, LR, CNN and QRSCNN. The actual flight delay data is processed to verify the effectiveness of the proposed classification approach, which can achieve higher accuracy than KNN, NB, UCNN, CNN, SSA-LCNN and SSA-TCNN in different time periods. The experimental results show that the SSA-LTCNN can fast and effectively realize the classification of MIT-BIH Arrhythmia Data and flight delay data.
Read full abstract