Abstract

Deep-learning classification systems have the potential to improve cancer diagnosis. However, development of these computational approaches so far depends on prior pathological annotations and large training datasets. The manual annotation is low-resolution, time-consuming, highly variable and subject to observer variance. To address this issue, we developed a method, H&E Molecular neural network (HEMnet). HEMnet utilizes immunohistochemistry as an initial molecular label for cancer cells on a H&E image and trains a cancer classifier on the overlapping clinical histopathological images. Using this molecular transfer method, HEMnet successfully generated and labeled 21,939 tumor and 8782 normal tiles from ten whole-slide images for model training. After building the model, HEMnet accurately identified colorectal cancer regions, which achieved 0.84 and 0.73 of ROC AUC values compared to p53 staining and pathological annotations, respectively. Our validation study using histopathology images from TCGA samples accurately estimated tumor purity, which showed a significant correlation (regression coefficient of 0.8) with the estimation based on genomic sequencing data. Thus, HEMnet contributes to addressing two main challenges in cancer deep-learning analysis, namely the need to have a large number of images for training and the dependence on manual labeling by a pathologist. HEMnet also predicts cancer cells at a much higher resolution compared to manual histopathologic evaluation. Overall, our method provides a path towards a fully automated delineation of any type of tumor so long as there is a cancer-oriented molecular stain available for subsequent learning. Software, tutorials and interactive tools are available at:https://github.com/BiomedicalMachineLearning/HEMnet

Highlights

  • Histopathological examination of tissue is indispensable for the accurate diagnosis and treatment of cancer[1–3]

  • We aim to address three key challenges, namely the dependence on the variable pathologist annotation for model training, the need to have a large number of images for training, and the demand to achieve a high-resolution and quantitative prediction of cancer cells

  • Histopathological examination of hematoxylin and eosin (H&E) images has been the gold standard for pathologic diagnosis of almost all suspected cancer patients[3,28]

Read more

Summary

BACKGROUND

Histopathological examination of tissue is indispensable for the accurate diagnosis and treatment of cancer[1–3]. A key challenge for deep learning is the need for a very large number of accurately labeled data[11] For this approach, many methods require WSIs that are manually annotated by a pathologist[12]. Mutations result in the stabilization and subsequently accumula- We used these image tiles to train a deep-transfer-learning tion of p53 in malignant cells[21], allowing it to be readily detected classifier to identify cancer regions in clinical H&E images using by IHC. With thousands of labeled tiles, a convolutional neural network classifier was trained based on an in-house colorectal cancer dataset With this training and testing approach, we achieved a high performance on an independent set of histopathologic sections and images. We labeled the H&E image based on the p53 staining pattern where p53-positive regions are labeled

RESULTS
A Su et al 3
DISCUSSION
Findings
CODE AVAILABILITY
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call