In view of the limited number of extracted sound features, the lack of in-depth analysis of applicable sound features, and the lack of in-depth study of the selection basis and optimization process of classification models in the existing broiler sound classification or recognition research, the author proposes a recognition method for broiler sound signals based on multi-domain sound features and classification models. The implementation process is divided into the training stage and the testing stage. In the training stage, the experimental area is built, and multiple segments of broiler sound signals are collected and filtered. Through sub-frame processing and endpoint detection, the combinations of start frames and end frames of multiple sound types in broiler sound signals are obtained. A total of sixty sound features from four aspects of time domain, frequency domain, Mel-Frequency Cepstral Coefficients (MFCC), and sparse representation are extracted from each frame signal to form multiple feature vectors. These feature vectors are labeled manually to build the data set. The min-max standardization method is used to process the data set, and the random forest is used to calculate the importance of sound features. Then, thirty sound features that contribute more to the classification effect of the classification model are retained. On this basis, the classification models based on seven classification algorithms are trained, the best-performing classification model based on k-Nearest Neighbor (kNN) is obtained, and its inherent parameters are optimized. Then, the optimal classification model is obtained. The test results show that the average classification accuracy achieved by the decision-tree-based classifier (abbreviated as DT classifier) on the data set before and after min–max standardization processing is improved by 0.6%, the average classification accuracy achieved by the DT classifier on the data set before and after feature selection is improved by 3.1%, the average classification accuracy achieved by the kNN-based classification model before and after parameter optimization is improved by 1.2%, and the highest classification accuracy is 94.16%. In the testing stage, for a segment of the broiler sound signal collected in the broiler captivity area, the combinations of the start frames and end frames of multiple sound types in the broiler sound signal are obtained through signal filtering, sub-frame processing, endpoint detection, and other steps. Thirty sound features are extracted from each frame signal to form the data set to be predicted. The optimal classification model is used to predict the labels of each piece of data in the data set to be predicted. By performing majority voting processing on the predicted labels of the data combination corresponding to each sound type, the common labels are obtained; that is, the predicted types are obtained. On this basis, the definition of recognition accuracy for broiler sound signals is proposed. The test results show that the classification accuracy achieved by the optimal classification model on the data set to be predicted is 93.57%, and the recognition accuracy achieved on the multiple segments of the broiler sound signals is 99.12%.
Read full abstract