Improving Performance of Machine Learning on Prediction of Breast Cancer Over a Small Sample Dataset

Neetu Sangari,Yue Qu

doi:10.1007/978-3-030-71704-9_70

Abstract

The application of machine learning (ML) algorithms aim to develop prognostic tools that could be trained on data that is routinely collected. In a typical scenario, the ML algorithm-based prognostic tool is utilized to search through large volumes of data to look for complex relationships in the training data. However, not much attention has been devoted to scenarios where small sample datasets are a widespread occurrence in research areas involving human participants such as clinical trials, genetics, and neuroimaging. In this research, we have studied the impact of the size of the sample dataset on the model performance of different ML algorithms. We compare the model fitting and model prediction performance on the original small dataset and the augmented dataset. Our research has discovered that the model fitted on a small dataset exhibits severe overfitting during the testing stage, which reduces when the model is trained on the augmented dataset. However, to different ML algorithms, the improvement of the model performance due to trained by the augmented dataset may vary.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving Performance of Machine Learning on Prediction of Breast Cancer Over a Small Sample Dataset

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Investigating Machine Learning as a Basis for Asteroid Taxnomies in the 3-Micron Spectral Region
Matthew Richardson ... Amanda Sickafoose
-
Matthew Richardson, et. al.Matthew Richardson ... Amanda Sickafoose
08 Oct 2020
08 Oct 2020

Construction of a small sample dataset and identification of Pitaya trees (Selenicereus) based on UAV image on close-range acquisition
Qianxia Li ... Lihui Yan
Journal of Applied Remote Sensing | VOL. 16
Qianxia Li, et. al.Qianxia Li ... Lihui Yan
06 Apr 2022
Journal of Applied Remote Sensing | VOL. 16

Plants meet machines: Prospects in machine learning for plant biology
Pamela S Soltis ... Alina Zare
Applications in Plant Sciences | VOL. 8
Pamela S Soltis, et. al.Pamela S Soltis ... Alina Zare
01 Jun 2020
Applications in Plant Sciences | VOL. 8

Application of machine learning in predicting survival outcomes involving real-world data: a scoping review
Yinan Huang ... Rajender R Aparasu
BMC Medical Research Methodology | VOL. 23
Yinan Huang, et. al.Yinan Huang ... Rajender R Aparasu
13 Nov 2023
BMC Medical Research Methodology | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving Performance of Machine Learning on Prediction of Breast Cancer Over a Small Sample Dataset

Abstract

Talk to us

Similar Papers