One important aspect of human behavior understanding is the recognition and monitoring of daily activities. An accurate activity recognition system can improve the quality of life in many key areas. The multi-metric feature extraction and DeepForest classifier designed in this paper effectively had solved the problems of incomprehensive feature extraction and insufficient classifier accuracy.First, this paper extracts a total of 450 dimensional feature vectors using mean, variance, maximum, skewness, minimum, kurtosis, regression of independent variables and sample entropy as feature indicators, so that the features of the original vectors can be presented comprehensively. The dimensionality of the feature set is too large, so PCA is used to reduce the dimensionality of the high-dimensional feature vector. After studying the relationship between the dimensionality reduction and the cumulative contribution rate, it is concluded that the cumulative contribution rate reaches 90% when the dimensionality is 15, which can retain the original features better. Then DeepForest was used as the classifier, and the reduced-dimensional feature set was used as the sample set to divide the training set and test set by 3:1. The model was tested on the test set after training, and the classification accuracy of the test set was 98.202%. Six classifiers, GaussianNB, SVM, K-NN, XGBoost, RandomForest and DecisionTree, were selected as the control experimental group, of which only the RandomForest model reached 97%, while the rest of the control models did not achieve 95% effect, indicating that DeepForest was more accurate in classifying human activities.Next, the generalisation ability of the DeepForest classifier was assessed using Monte Carlo Cross Validation (MCCV), K-fold and its confusion matrix. The MCCV validation used 50% of the data as the training set and 30% of the data as the test set, and set the number of splits to 10, resulting in a mean accuracy of 98.227%. The size of K in the K-fold validation was determined to be 8 based on the number of people conducting the human activity experiment, and the final mean accuracy value obtained was 97.790%. The combined confusion matrix from the K-fold validation (which aggregates the confusion matrix for each classification result) was calculated, and the results showed that the highest accuracy reached 100% for A9 and A14 classifications, where A5 and A10, A15 and A10, A12 and A5 were more likely to be confused, with the highest error rate for A5 classification, which was 4.167%, and the rest of the activity classifications were better.
Read full abstract