Abstract

Globally, liver cancer causes more than 700,000 deaths each year and is the second-leading cause of death from cancer. Hepatocellular carcinoma (HCC) is the most common type of liver cancer in adults and accounts for most deaths in cirrhosis patients. Patients with early-stage liver cancer can be treated by surgical intervention with a good prognosis; thus, early diagnosis, as confirmed by liver pathology examination, is necessary to combat HCC. Conventional manual pathology examination requires considerable time and labor, even with established expertise. It is widely accepted that intelligent classifiers may prove effective in the diagnosis process. In this study, we used a GoogLeNet (Inception-V1)-based binary classifier to classify HCC histopathology images. The classifier achieved 91.37% (±2.49) accuracy, 92.16% (±4.93) sensitivity, and 90.57% (±2.54) specificity in HCC classification. Although the classification accuracy of deep learning is reported to be positively correlated with the amount of training data, it is often uncertain how much training data are required for deep learning to achieve satisfactory performance in clinical diagnosis. Moreover, deep learning methods require annotated data to generate efficient models. However, annotated data are a relatively scarce resource and can be expensive to obtain. Hence, the relationship between classification accuracy and the number of liver histopathology images for training was investigated. An inverse power law function-based estimation model is proposed to evaluate the minimum number of annotated training images required for a desired diagnostic accuracy.

Highlights

  • Liver cancer causes more than 700,000 deaths each year [1], is the sixth-most frequent cancer (6%), and is the second-leading cause of death from cancer (9%) [2]

  • An outline of our study is as follows: (i) we adopted a GoogLeNet (Inception-V1)-based deep learning architecture to classify Hepatocellular carcinoma (HCC) and Normal histopathology images. (ii) We determined the impact of diversity of HCC histopathology images on testing accuracy for new images when using single image training. (iii) We investigated and evaluated the relationship between model accuracies and the training dataset size

  • The results show that the GoogLeNet (Inception-V1) has the potential to better classify histopathology images compared to other deep learning models

Read more

Summary

INTRODUCTION

Liver cancer causes more than 700,000 deaths each year [1], is the sixth-most frequent cancer (6%), and is the second-leading cause of death from cancer (9%) [2]. Uyumazturk et al recently developed a DenseNet-121based classifier to classify the subtypes of liver tumors (hepatocellular carcinoma and cholangiocarcinoma (CC)) For their model development, a total of 70 WSIs (35 HCC and 35 CC) were randomly selected from the Cancer Genome Atlas (TCGA). Current studies on deep learning-based HCC histopathology image classification focus on investigating the test results obtained using a specific size of training data. In other words, these studies used a specific number of WSIs for model training, following which the testing accuracy was obtained. An outline of our study is as follows: (i) we adopted a GoogLeNet (Inception-V1)-based deep learning architecture to classify HCC and Normal histopathology images. The classification accuracies are tested with different sizes of training datasets. (iv) We used an inverse power law function-based fitting curve to evaluate the minimum number of annotated training images required to achieve a desired diagnostic accuracy

METHODS AND MATERIALS
EVALUATION OF 25-IMAGE TRAINING MODEL
Findings
CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.