Cancer, as a shocking disease, is one of the most common malignant tumors among women, posing a huge threat to the physical health and safety of women worldwide. With the continuous development of science and technology, more and more high and new technologies are involved in the diagnosis and prediction of breast cancer. In recent years, intelligent medical assistants supported by data mining and machine learning algorithms have provided necessary support for doctors’ diagnosis. This study proposes an improved LightGBM hybrid integration model. Introducing gradient harmonic loss and cross entropy loss to enhance the model’s attention to minority classes in the dataset and alleviate the impact of data imbalance on diagnostic results. Designing whale optimization algorithm to improve LightGBM to achieve iterative optimization of hyperparameters, and enhance the overall performance of the model. Proposing Jacobian regularization method to denoise LightGBM to solve the problem of model sensitivity to noise. Developing the LightGBM hybrid integration model to ensure the accuracy and stability of model diagnosis on diverse and imbalanced datasets. The effectiveness of the proposed method has been comprehensively compared and verified through the dataset in the UCI machine learning repository, and the results show that the proposed method has achieved good diagnostic performance in all indicators. The hybrid integration model proposed in this paper can provide effective auxiliary support for doctors to diagnose breast cancer.
Read full abstract