A Pathologist-Annotated Dataset for Validating Artificial Intelligence: A Project Description and Pilot Study

Sarah Dudgeon ,Manasi Sheth,Joel Saltz,Ashish Sharma,Brandon D Gallas,Matthew G Hanna,Darick Tong,Weijie Chen,Mohamed Amgad,Anant Madabhushi,Bruce Werness,Clifford H Szu,Markus D Herrmann,Denis Larsimont,Si Wen,Evan Szu,Rajarsi Gupta,Evangelos Hytopoulos,Steven N Hart,Roberto Salgado,Vikram Singh ,Richard S.p Huang ,Hetal D Marble

doi:10.4103/jpi.jpi_83_20

Abstract

Purpose: Validating artificial intelligence algorithms for clinical use in medical images is a challenging endeavor due to a lack of standard reference data (ground truth). This topic typically occupies a small portion of the discussion in research papers since most of the efforts are focused on developing novel algorithms. In this work, we present a collaboration to create a validation dataset of pathologist annotations for algorithms that process whole slide images. We focus on data collection and evaluation of algorithm performance in the context of estimating the density of stromal tumor-infiltrating lymphocytes (sTILs) in breast cancer. Methods: We digitized 64 glass slides of hematoxylin- and eosin-stained invasive ductal carcinoma core biopsies prepared at a single clinical site. A collaborating pathologist selected 10 regions of interest (ROIs) per slide for evaluation. We created training materials and workflows to crowdsource pathologist image annotations on two modes: an optical microscope and two digital platforms. The microscope platform allows the same ROIs to be evaluated in both modes. The workflows collect the ROI type, a decision on whether the ROI is appropriate for estimating the density of sTILs, and if appropriate, the sTIL density value for that ROI. Results: In total, 19 pathologists made 1645 ROI evaluations during a data collection event and the following 2 weeks. The pilot study yielded an abundant number of cases with nominal sTIL infiltration. Furthermore, we found that the sTIL densities are correlated within a case, and there is notable pathologist variability. Consequently, we outline plans to improve our ROI and case sampling methods. We also outline statistical methods to account for ROI correlations within a case and pathologist variability when validating an algorithm. Conclusion: We have built workflows for efficient data collection and tested them in a pilot study. As we prepare for pivotal studies, we will investigate methods to use the dataset as an external validation tool for algorithms. We will also consider what it will take for the dataset to be fit for a regulatory purpose: study size, patient population, and pathologist training and qualifications. To this end, we will elicit feedback from the Food and Drug Administration via the Medical Device Development Tool program and from the broader digital pathology and AI community. Ultimately, we intend to share the dataset, statistical methods, and lessons learned.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Pathology Informatics	Publication Date: Jan 1, 2021
Citations: 20	License type: cc-by-nc-sa

R Discovery Prime

R Discovery Prime

A Pathologist-Annotated Dataset for Validating Artificial Intelligence: A Project Description and Pilot Study

Abstract

Talk to us

Similar Papers

More From: Journal of Pathology Informatics

Lead the way for us

Similar Papers

Abstract 460: Tools for collecting pathologist annotations and understanding interobserver variability
Katherine N Elfer ... Bruce Werness
Cancer Research | VOL. 82
Katherine N Elfer, et. al.Katherine N Elfer ... Bruce Werness
15 Jun 2022
Cancer Research | VOL. 82

Artificial intelligence-powered whole-slide image analyzer reveals a distinctive distribution of tumor-infiltrating lymphocytes in neuroendocrine tumors and carcinomas.
Hyung-Gyo Cho ... Jeongun Ryu
Journal of Clinical Oncology | VOL. 40
Hyung-Gyo Cho, et. al.Hyung-Gyo Cho ... Jeongun Ryu
01 Jun 2022
Journal of Clinical Oncology | VOL. 40

Diffusion kurtosis imaging in evaluating gliomas: different region of interest selection methods on time efficiency, measurement repeatability, and diagnostic ability.
Jian-Ping Chu ... Yi-Su Tian
European Radiology | VOL. 31
Jian-Ping Chu, et. al.Jian-Ping Chu ... Yi-Su Tian
28 Aug 2020
European Radiology | VOL. 31

Automated Pipeline for Brain ROI Analysis with Results Comparable to Previous Freehand Measures in Clinical Settings
Tero Ilvesmäki ... Ullamari Hakulinen
-
Tero Ilvesmäki, et. al.Tero Ilvesmäki ... Ullamari Hakulinen
13 Jun 2017
13 Jun 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Pathologist-Annotated Dataset for Validating Artificial Intelligence: A Project Description and Pilot Study

Abstract

Talk to us

Similar Papers

More From: Journal of Pathology Informatics