Deep learning‐based research on the influence of training data size for breast cancer pathology detection

Chongyang Cui,Xiaolei Qu,Han Lei,Shangchun Fan,Dezhi Zheng

doi:10.1049/joe.2018.9093

Chongyang Cui, Xiaolei Qu + Show 3 more

Open Access

https://doi.org/10.1049/joe.2018.9093

Copy DOI

Journal: The Journal of Engineering	Publication Date: Dec 1, 2019
Citations: 4	License type: CC BY 3.0

Affiliation: Beihang University

Abstract

In pathological diagnosis of breast cancer, there are problems such as shortage of pathologists, difficulties in sample labeling, and huge workload of manual diagnosis. Therefore, deep learning-based computer-assisted pathology analysis systems have been developed to diagnose breast cancer and have achieved impressive results. However, it is difficult to obtain a large number of training sets due to the scarcity of pathological images and the huge labeling costs. Therefore, the size of the training set should be planned before building the pathology computer-assisted breast cancer analysis system. Here, the authors present a study to determine the optimal size of the training data set needed to achieve high classification accuracy when developing a pathology computer-assisted breast cancer analysis system. The authors trained two kind of CNNs using six different sizes of training data set and then tested the resulting system with a total of 10,000 images. All images were acquired from the Camelyon17 challenge. Here, the authors propose a scheme for determining the size of the training set and the size of the model in developing the pathology computer-assisted breast cancer analysis systems, which can be easily applied to develop systems for other different pathological images.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep learning‐based research on the influence of training data size for breast cancer pathology detection

Abstract

Talk to us

Similar Papers

More From: The Journal of Engineering

Lead the way for us

Similar Papers

Automated classification of fauna in seabed photographs: The impact of training and validation dataset size, with considerations for the class imbalance
Jennifer M Durden ... Henry A Ruhl
Progress in Oceanography | VOL. 196
Jennifer M Durden, et. al.Jennifer M Durden ... Henry A Ruhl
20 May 2021
Progress in Oceanography | VOL. 196

A Novel Hybrid CNN-AIS Visual Pattern Recognition Engine
Vandna Bhalla ... Arihant Jain
-
Vandna Bhalla, et. al.Vandna Bhalla ... Arihant Jain
01 Jan 2015
01 Jan 2015

Weighted Gaussian Process Regression for Single Image Super-resolution Based on Randomized Sample Clustering and Augmentation
Chao Guo ... Mingbo Yang
-
Chao Guo, et. al.Chao Guo ... Mingbo Yang
28 Jun 2021
28 Jun 2021

Comparison of data driven modeling approaches for temperature prediction in data centers
Jayati Athavale ... Yogendra Joshi
International Journal of Heat and Mass Transfer | VOL. 135
Jayati Athavale, et. al.Jayati Athavale ... Yogendra Joshi
21 Feb 2019
International Journal of Heat and Mass Transfer | VOL. 135

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep learning‐based research on the influence of training data size for breast cancer pathology detection

Abstract

Talk to us

Similar Papers

More From: The Journal of Engineering