Machine learning based on genetic algorithms is an important application. The multi-dimensional ordered sample clustering problem is often solved using Fisher’s optimal segmentation method. However, this method has obvious shortcomings when encountering long sample problems due to its high storage requirements during the computation process. Therefore, Fisher’s optimal two-segmentation method is generally used in practical problems instead, which avoids storage problems. But it is prone to local optima. Based on the analysis of the shortcomings of the Fisher optimal segmentation and optimal two-segmentation algorithms, this paper proposes a genetic-based machine learning clustering algorithm, which overcomes the problem of Fisher’s optimal two-segmentation algorithm being prone to local optima and also solves the problem of high storage requirements during the computation process of Fisher’s optimal segmentation method. The application of this algorithm in the optimization system of water environment monitoring points shows that it is effective.
Read full abstract