Hidden Variables in Deep Learning Digital Pathology and Their Potential to Cause Batch Effects: Prediction Model Study.

Max Schmitt,Achim Hekler,Roman Christoph Maron,Axel Hauschild,Albrecht Stenzinger,Jakob Nikolas Kather,Dieter Krahl,Michael Weichenthal,Jochen Sven Utikal,Heinz Kutzner,Titus Josef Brinker,Frederick Klauschen,Stefan Fröhling,Christof Von Kalle,Markus Tiemann,Sebastian Haferkamp,Eva Krieghoff-Henning

doi:10.2196/23436

Abstract

BackgroundAn increasing number of studies within digital pathology show the potential of artificial intelligence (AI) to diagnose cancer using histological whole slide images, which requires large and diverse data sets. While diversification may result in more generalizable AI-based systems, it can also introduce hidden variables. If neural networks are able to distinguish/learn hidden variables, these variables can introduce batch effects that compromise the accuracy of classification systems.ObjectiveThe objective of the study was to analyze the learnability of an exemplary selection of hidden variables (patient age, slide preparation date, slide origin, and scanner type) that are commonly found in whole slide image data sets in digital pathology and could create batch effects.MethodsWe trained four separate convolutional neural networks (CNNs) to learn four variables using a data set of digitized whole slide melanoma images from five different institutes. For robustness, each CNN training and evaluation run was repeated multiple times, and a variable was only considered learnable if the lower bound of the 95% confidence interval of its mean balanced accuracy was above 50.0%.ResultsA mean balanced accuracy above 50.0% was achieved for all four tasks, even when considering the lower bound of the 95% confidence interval. Performance between tasks showed wide variation, ranging from 56.1% (slide preparation date) to 100% (slide origin).ConclusionsBecause all of the analyzed hidden variables are learnable, they have the potential to create batch effects in dermatopathology data sets, which negatively affect AI-based classification systems. Practitioners should be aware of these and similar pitfalls when developing and evaluating such systems and address these and potentially other batch effect variables in their data sets through sufficient data set stratification.

Highlights

The advent of artificial intelligence (AI) in digital pathology (DP) has resulted in the development of various algorithms for the detection, classification, and further evaluation of multiple cancer subtypes [1]
Each convolutional neural network (CNN) training and evaluation run was repeated multiple times, and a variable was only considered learnable if the lower bound of the 95% confidence interval of its mean balanced accuracy was above 50.0%
The successful implementation of CNN-based assistance systems in DP is complicated by a plethora of challenges [8,9,10], some of which are domain-specific, while others are omnipresent in the field of deep learning (DL) and machine learning (ML) in general

Summary

Introduction

The advent of artificial intelligence (AI) in digital pathology (DP) has resulted in the development of various algorithms for the detection, classification, and further evaluation of multiple cancer subtypes [1]. In DP, such artifacts are introduced during tissue processing and slide preparation [13], and presumably during slide digitization, image compression, and storage [14], all of which affect slide and image appearance (Figure 1). We expand on this definition of batch effects by including biological factors, presumably unrelated to the actual classification task, as causative agents. Both factors (biological and nonbiological) are referred to as hidden variables from here on. If neural networks are able to distinguish/learn hidden variables, these variables can introduce batch effects that compromise the accuracy of classification systems

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Medical Internet Research	Publication Date: Feb 2, 2021
Citations: 43	License type: cc-by

R Discovery Prime

R Discovery Prime

Hidden Variables in Deep Learning Digital Pathology and Their Potential to Cause Batch Effects: Prediction Model Study.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Medical Internet Research

Lead the way for us

Similar Papers

Decision letter: Development and evaluation of a live birth prediction model for evaluating human blastocysts from a retrospective study
Larisa V Suturina ... Ricardo Azziz
-
Larisa V Suturina, et. al.Larisa V Suturina ... Ricardo Azziz
12 Dec 2022
12 Dec 2022

Author response: Development and evaluation of a live birth prediction model for evaluating human blastocysts from a retrospective study
Yifan Gu ... Ge Lin
-
Yifan Gu, et. al.Yifan Gu ... Ge Lin
12 Jan 2023
12 Jan 2023

Editor's evaluation: Development and evaluation of a live birth prediction model for evaluating human blastocysts from a retrospective study
Larisa V Suturina
-
Larisa V SuturinaLarisa V Suturina
12 Dec 2022
12 Dec 2022

Comprehensive Study for Breast Cancer Using Deep Learning and Traditional Machine Learning
-
ZANCO JOURNAL OF PURE AND APPLIED SCIENCES | VOL. 34
--
12 Apr 2022
ZANCO JOURNAL OF PURE AND APPLIED SCIENCES | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hidden Variables in Deep Learning Digital Pathology and Their Potential to Cause Batch Effects: Prediction Model Study.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Medical Internet Research