The diagnostic classification of digitized tissue images based on histopathologic lesions present in whole slide images (WSI) is a significant task that eludes modern image classification techniques. Even with advanced methods designed for digital histopathology, the domain of toxicologic pathology presents challenges in that histopathologic features may be at times complex, subtle, and/or rare. We propose an innovative weakly supervised learning method that leverages minimal annotations, a state-of-the-art self-supervised vision transformer for embedding extraction, and a novel guided attention mechanism that is better suited for heavily imbalanced datasets typical in toxicologic pathology. Our model demonstrates improvements in diagnostic classification and attention heatmap quality over the previously described clustering-constrained-attention multiple-instance learning method on several lesion classes in rat livers (38% improvement in AUC). We also demonstrate how an ensemble of binary classifiers improves interpretability and allows for multiclass classification and the classification of diagnostic regions of interest in each slide. The improved classification performance and higher contrast heatmaps better support toxicologic pathologists' histopathology analysis and will enable more efficient workflows as they are further refined and integrated into routine use.
Read full abstract