Natural vs balanced distribution in deep learning on whole slide images for cancer detection

Ismat Ara Reshma,Sylvain Cussat-Blanc,Hervé Luga,Radu Tudor Ionescu,Josiane Mothe

doi:10.1145/3412841.3441884

Ismat Ara Reshma, Sylvain Cussat-Blanc + Show 3 more

Open Access

https://doi.org/10.1145/3412841.3441884

Copy DOI

Publication Date: Mar 22, 2021
Citations: 2	License type: other-oa

Affiliation: Université de Toulouse, University of Bucharest

Abstract

The class distribution of data is one of the factors that regulates the performance of machine learning models. However, investigations on the impact of different distributions available in the literature are very few, sometimes absent for domain-specific tasks. In this paper, we analyze the impact of natural and balanced distributions of the training set in deep learning (DL) models applied on histological images, also known as whole slide images (WSIs). WSIs are considered as the gold standard for cancer diagnosis. In recent years, researchers have turned their attention to DL models to automate and accelerate the diagnosis process. In the training of such DL models, filtering out the non-regions-of-interest from the WSIs and adopting an artificial distribution---usually a balanced distribution---is a common trend. In our analysis, we show that keeping the WSIs data in their usual distribution---which we call natural distribution---for DL training produces fewer false positives (FPs) with comparable false negatives (FNs) than the artificially-obtained balanced distribution. We conduct an empirical comparative study with 10 random folds for each distribution, comparing the resulting average performance levels in terms of five different evaluation metrics. Experimental results show the effectiveness of the natural distribution over the balanced one across all the evaluation metrics.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Natural vs balanced distribution in deep learning on whole slide images for cancer detection

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

P995 Development and validation of a deep learning model based on histological characteristics to predict primary non-response of infliximab in Crohn's disease
Z Liu ... R Mao
Journal of Crohn's and Colitis | VOL. 18
Z Liu, et. al.Z Liu ... R Mao
24 Jan 2024
Journal of Crohn's and Colitis | VOL. 18

Abstract 2584: Decoding colon cancer recurrence: Unveiling accurate predictions with attention-guided deep neural networks on histopathological whole slide images
Mohammad K Alexanderani ... Jorge Moscat
Cancer Research | VOL. 84
Mohammad K Alexanderani, et. al.Mohammad K Alexanderani ... Jorge Moscat
22 Mar 2024
Cancer Research | VOL. 84

Deep learning to predict subtypes of poorly differentiated lung cancer from biopsy whole slide images.
Gouji Toyokawa ... Kengo Tateishi
Journal of Clinical Oncology | VOL. 39
Gouji Toyokawa, et. al.Gouji Toyokawa ... Kengo Tateishi
20 May 2021
Journal of Clinical Oncology | VOL. 39

KMIT-Pathology: Digital Pathology AI Platform for Cancer Biomarkers Identification on Whole Slide Images
Rajasekaran Subramanian ... R Devika Rubi
International Journal of Advanced Computer Science and Applications | VOL. 13
Rajasekaran Subramanian, et. al.Rajasekaran Subramanian ... R Devika Rubi
01 Jan 2021
International Journal of Advanced Computer Science and Applications | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Natural vs balanced distribution in deep learning on whole slide images for cancer detection

Abstract

Talk to us

Similar Papers