Abstract BACKGROUND Histologic examination is vital in oncology research and diagnostics. The adoption of digital scanning of whole slide images (WSI) has created an opportunity to leverage deep learning-based image classification methods to enhance diagnosis and risk stratification. However, technical limitations prevent training and deployment of accurate comprehensive multiclass deep convolutional neural networks (DCNN) models for histopathology image classification. The input dimensions of DCNN architectures are small compared to the typical pathologist field of view, degrading performance by excluding important architectural features. Furthermore, data requirements for comprehensive models are sufficiently large to overwhelm the system memory during training. METHODS A method termed Learned Resizing with Efficient Training (LRET) was developed to address the main limitations of traditional histopathology classification model training. The LRET method couples efficient training techniques with image resizing to facilitate seamless integration of larger histology image patches into state-of-the-art classification models while preserving important structural information. The LRET method was coupled with two distinct resizing techniques to train three diverse histology image datasets using five different DCNN architectures. Performance metrics were compared on cross validation and hold out test sets. RESULTS LRET-trained models were flexible to multiple input patch dimensions and DCNN models. We demonstrated performance improvement across all datasets while significantly reducing the training time and resources over traditional methods. Using a large-scale, multiclass brain tumor classification dataset consisting of 74 distinct histopathologic classes, LRET-trained models outperformed existing methods by 15-28% in accuracy, yielding 94% accuracy for the best model. CONCLUSION The LRET method for DCNN training significantly enhances the performance of large-scale multiclass histopathology image classification. The implications of this work extend to broader applications within medical imaging and beyond, where efficient integration of high-resolution images into deep learning pipelines is paramount for driving advancements research and clinical practice.
Read full abstract