Breast cancer (BC) infection, which is peculiar to women, brings about the high rate of deaths among women in every part of the world. The early investigation of BC has minimized the severe effects of cancer as compared to the last stage diagnosis. Doctors for diagnostic tests usually suggest the medical imaging modalities like mammograms or biopsy histopathology (Hp) images. However, Hp image analysis gives doctors more confidence to diagnose BC as compared to mammograms. Many studies used Hp images to develop BC classification models to assist doctors in early BC diagnosis. However, these models lack better and reliable results in terms of reporting multiple performance evaluation metrics. Therefore, the goal of this study is to create a reliable, more accurate model that consumes minimum resources by using transfer learning based convolution neural network model. The proposed model uses the trained model after fine tuning, hence requires less number of images and can show better results on minimum resources. BreakHis dataset, which is available publicly has been employed in overall experiments in this research. BreakHis dataset is separated into training, testing, and validation for the experimentation. In addition, the dataset for training was augmented followed by stain normalization. By using the concept of transfer learning (TL), AleNext was retained after fine-tuning the last layer for binary classification like benign and malignant. Afterward, preprocessed images are fed into the TL based model for training. The model training was performed many times by changing the hyper-parameters randomly until the minimum validation loss was achieved. Now the trained model was used for feature extraction. The extracted features were further evaluated by using six ML classifiers (i.e. softmax, Decision tree, Naïve Bayes, Linear discriminant analysis, Support vector machine, k-nearest neighbor) through five performance measures such as precision, F-measure, accuracy, specificity, and sensitivity for experimental evaluation. The softmax has outperformed among all classifiers. Furthermore, to reduce the wrong prediction, a misclassification reducing (MR) algorithm was developed. After using the MR algorithm the proposed model produced better and reliable results. The observed accuracy, specificity, sensitivity, precision and F measure are 81.25%, 77.47%, 82.49%, 91.70%, and 86.80% respectively. These results show that the proposed TL based model along with misclassification reduction algorithm produced comparable results to the current baseline models. Hence, the expected model could serve as a second opinion for BC classification in any healthcare center.
Read full abstract