Sparse coding of pathology slides compared to transfer learning with deep neural networks

Will Fischer,Judith D Cohn,Garrett T Kenyon,Nga T T Nguyen,Sanketh S Moudgalya

doi:10.1186/s12859-018-2504-8

Will Fischer, Judith D Cohn + Show 3 more

Open Access

https://doi.org/10.1186/s12859-018-2504-8

Copy DOI

Abstract

BackgroundHistopathology images of tumor biopsies present unique challenges for applying machine learning to the diagnosis and treatment of cancer. The pathology slides are high resolution, often exceeding 1GB, have non-uniform dimensions, and often contain multiple tissue slices of varying sizes surrounded by large empty regions. The locations of abnormal or cancerous cells, which may constitute a small portion of any given tissue sample, are not annotated. Cancer image datasets are also extremely imbalanced, with most slides being associated with relatively common cancers. Since deep representations trained on natural photographs are unlikely to be optimal for classifying pathology slide images, which have different spectral ranges and spatial structure, we here describe an approach for learning features and inferring representations of cancer pathology slides based on sparse coding.ResultsWe show that conventional transfer learning using a state-of-the-art deep learning architecture pre-trained on ImageNet (RESNET) and fine tuned for a binary tumor/no-tumor classification task achieved between 85% and 86% accuracy. However, when all layers up to the last convolutional layer in RESNET are replaced with a single feature map inferred via a sparse coding using a dictionary optimized for sparse reconstruction of unlabeled pathology slides, classification performance improves to over 93%, corresponding to a 54% error reduction.ConclusionsWe conclude that a feature dictionary optimized for biomedical imagery may in general support better classification performance than does conventional transfer learning using a dictionary pre-trained on natural images.

Highlights

Images of tumor biopsies have a long history in oncology, and remain an important component of cancer diagnosis and treatment; they provide promising opportunities for the application of machine learning to human health
Fischer et al BMC Bioinformatics 2018, 19(Suppl 18):489 using features trained from conventional photographic databases, i.e., “transfer learning,” it remains unclear whether such features are truly optimal for the specialized task of tumor discrimination from cancer pathology slides, for which the low-level image statistics are likely to be very different
Histological examination of tumor biopsies is a task currently performed by highly trained human pathologists, who assess the type and grade of tumors based on the appearance of thin tissue slices, typically stained with eosin and hematoxylin, in an optical microscope

Summary

Introduction

Images of tumor biopsies have a long history in oncology, and remain an important component of cancer diagnosis and treatment; they provide promising opportunities for the application of machine learning to human health. Automated feature discovery has become increasingly common, and some have argued that “general purpose” image feature dictionaries (trained on ImageNet, for instance) may achieve high performance on specialized classification tasks [5,6,7]. Conventional deep learning approaches are problematical here due to the large, non-uniform image sizes, limited amount of training examples and imbalanced nature of the image data, and the sometime necessity for labeling (e.g. annotations that distinguish normal from cancerous tissue within an image); much of the substantial body of work in this area has been focused on segmentation within an image [10] or limited to a small number of tumor types [7, 11,12,13,14]. Since deep representations trained on natural photographs are unlikely to be optimal for classifying pathology slide images, which have different spectral ranges and spatial structure, we here describe an approach for learning features and inferring representations of cancer pathology slides based on sparse coding

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Dec 1, 2018
Citations: 13	License type: open-access

R Discovery Prime

R Discovery Prime

Sparse coding of pathology slides compared to transfer learning with deep neural networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

A novel augmented deep transfer learning for classification of COVID-19 and other thoracic diseases from X-rays
Fouzia Altaf ... Syed M S Islam
Neural Computing and Applications | VOL. 33
Fouzia Altaf, et. al.Fouzia Altaf ... Syed M S Islam
29 Apr 2021
Neural Computing and Applications | VOL. 33

Pre-text Representation Transfer for Deep Learning with Limited and Imbalanced Data: Application to CT-Based COVID-19 Detection
Fouzia Altaf ... Naveed Akhtar
-
Fouzia Altaf, et. al.Fouzia Altaf ... Naveed Akhtar
01 Jan 2023
01 Jan 2023

HybridTransferNet: soil image classification through comprehensive evaluation for crop suggestion
Chetan Raju ... Ajay Prakash Basappa Vijaya
IAES International Journal of Artificial Intelligence (IJ-AI) | VOL. 13
Chetan Raju, et. al.Chetan Raju ... Ajay Prakash Basappa Vijaya
01 Jun 2024
IAES International Journal of Artificial Intelligence (IJ-AI) | VOL. 13

Autoencoder and restricted Boltzmann machine for transfer learning in functional magnetic resonance imaging task classification
Jundong Hwang ... Jong-Hwan Lee
Heliyon | VOL. 9
Jundong Hwang, et. al.Jundong Hwang ... Jong-Hwan Lee
01 Jul 2023
Heliyon | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sparse coding of pathology slides compared to transfer learning with deep neural networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics