THE EFFECT OF DATASETS ON BREAST CANCER DETECTION MODELS

Kumawuese Jennifer Kurugh,Awwal Ahmad Babajo,Muhammad Aminu Ahmad

doi:10.33003/fjs-2020-0404-487

Kumawuese Jennifer Kurugh, Awwal Ahmad Babajo + Show 1 more

Open Access

https://doi.org/10.33003/fjs-2020-0404-487

Copy DOI

Journal: FUDMA JOURNAL OF SCIENCES	Publication Date: Jun 13, 2021
License type: CC BY 4.0

Abstract

Datasets are a major requirement in the development of breast cancer classification/detection models using machine learning algorithms. These models can provide an effective, accurate and less expensive diagnosis method and reduce life losses. However, using the same machine learning algorithms on different datasets yields different results. This research developed several machine learning models for breast cancer classification/detection using Random forest, support vector machine, K Nearest Neighbors, Gaussian Naïve Bayes, Perceptron and Logistic regression. Three widely used test data sets were used; Wisconsin Breast Cancer (WBC) Original, Wisconsin Diagnostic Breast Cancer (WDBC) and Wisconsin Prognostic Breast Cancer (WPBC). The results show that datasets affect the performance of machine learning classifiers. Also, the machine learning classifiers have different performances with a given breast cancer dataset

Full Text