Classification Backbone Research Articles

This paper addresses few-shot semantic segmentation and proposes a novel transductive end-to-end method that overcomes three key problems affecting performance. First, we present a novel ensemble of visual features learned from pretrained classification and semantic segmentation networks with the same architecture. Our approach leverages the varying discriminative power of these networks, resulting in rich and diverse visual features that are more informative than a pretrained classification backbone that is not optimized for dense pixel-wise classification tasks used in most state-of-the-art methods. Secondly, the pretrained semantic segmentation network serves as a base class extractor, which effectively mitigates false positives that occur during inference time and are caused by base objects other than the object of interest. Thirdly, a two-step segmentation approach using transductive meta-learning is presented to address the episodes with poor similarity between the support and query images. The proposed transductive meta-learning method addresses the prediction by first learning the relationship between labeled and unlabeled data points with matching support foreground to query features (intra-class similarity) and then applying this knowledge to predict on the unlabeled query image (intra-object similarity), which simultaneously learns propagation and false positive suppression. To evaluate our method, we performed experiments on benchmark datasets, and the results demonstrate significant improvement with minimal trainable parameters of 2.98M. Specifically, using Resnet-101, we achieve state-of-the-art performance for both 1-shot and 5-shot Pascal-5i\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$5^{i}$$\\end{document}, as well as for 1-shot and 5-shot COCO-20i\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$20^{i}$$\\end{document}.

Read full abstract

The analysis of bone marrow smears (BMS) serves as a critical diagnostic tool in hematology, offering valuable insights into the cellular composition and distinctive morphology of bone marrow cells. It enables the accurate diagnosis, classification, and monitoring of various hematologic diseases. Currently, both clinical routine diagnostics and scientific research, especially for the implementation of digital technologies, rely heavily on the manual selection of regions of interest (ROI) suitable for cell-level evaluation. Current guidelines recommend the manual evaluation of up to 500 cells per sample making the process time-consuming, cost-ineffective, subjective, and prone to human error. Consequently, significant inter-observer variability and inconsistencies in diagnosis often arise. Automated analysis of BMS using whole slide images (WSI) offers a promising approach to alleviate financial and time-related burdens while promoting standardization and streamlining of routine diagnostics. We have developed a machine learning approach utilizing a fully convolutional neural network (CNN) specifically designed for semantic segmentation based on the algorithm by Long et al. (IEEE Computer Society, 2015). The model is capable of learning deep semantic representations in its classification backbone combined with shape features in a shallow layer to produce detailed segmentations. The classification backbone consists of an adapted residual neural network with 50 layers (Resnet50) by He et al. (arXiv:1512.03385, 2015). To train the model, we curated a diverse dataset consisting of 60 different WSI BMS, each manually and independently annotated by at least two experts providing a robust ground truth (figure 1). Data augmentation was performed via horizontal flipping, random resizing, and random cropping. A train-test split of 80:20 was implemented with 5-fold internal cross-validation. The model's accuracy was evaluated as mean intersection-over-union ratio yielding high congruency between ground truth and model prediction (figure 2). The final model uses downscaled images effectively reducing data storage size and data transfer time decreasing fully automated ROI prediction time to &lt;300 ms for one WSI. The model allows for in-depth examination at the cellular level, harnessing the potential of thousands of relevant cells present in a WSI, in contrast to the limited range of 200 to 500 cells typically examined in conventional clinical routine diagnostics. Thereby, the model solves an essential bottleneck in computer vision in microscopy minimizing human manual labor and standardizing BMS processing results.

Read full abstract

Classification Backbone Research Articles

Related Topics

Articles published on Classification Backbone

CEDNet: A cascade encoder–decoder network for dense prediction

Multi-View Fusion Network-Based Gesture Recognition Using sEMG Data.

Weakly supervised learning for multi-class medical image segmentation via feature decomposition

Transductive meta-learning with enhanced feature ensemble for few-shot semantic segmentation

Transfer learning for galaxy feature detection: Finding giant star-forming clumps in low-redshift galaxies using Faster Region-based Convolutional Neural Network

Ships Detection on Aerial Imagery using Transfer Learning and Selective Search

Deep learning diagnostic performance and visual insights in differentiating benign and malignant thyroid nodules on ultrasound images.

Automated Preselection of Regions of Interest By Deep Learning Facilitates Rapid Whole Slide Image Analysis of Bone Marrow Smears

Linear listing order and hierarchical classification: history, conflict, and use

Predicting the efficacy of non-steroidal anti-inflammatory drugs in migraine using deep learning and three-dimensional T1-weighted images

Weakly supervised semantic segmentation via self-supervised destruction learning

Online intervention siamese tracking

UIU-Net: U-Net in U-Net for Infrared Small Object Detection.

Three-Dimensional Face Recognition Using Solid Harmonic Wavelet Scattering and Homotopy Dictionary Learning.

PDBL: Improving Histopathological Tissue Classification With Plug-and-Play Pyramidal Deep-Broad Learning.

A Computer Vision Model to Identify the Incorrect Use of Face Masks for COVID-19 Awareness

The Bangkok Urbanscapes Dataset for Semantic Urban Scene Understanding Using Enhanced Encoder-Decoder With Atrous Depthwise Separable A1 Convolutional Neural Networks

Context-aware Attentional Pooling (CAP) for Fine-grained Visual Classification

Video action detection by learning graph-based spatio-temporal interactions

Local Enhancement and Bidirectional Feature Refinement Network for Single-Shot Detector

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Classification Backbone Research Articles

Related Topics

Articles published on Classification Backbone

CEDNet: A cascade encoder–decoder network for dense prediction

Multi-View Fusion Network-Based Gesture Recognition Using sEMG Data.

Weakly supervised learning for multi-class medical image segmentation via feature decomposition

Transductive meta-learning with enhanced feature ensemble for few-shot semantic segmentation

Transfer learning for galaxy feature detection: Finding giant star-forming clumps in low-redshift galaxies using Faster Region-based Convolutional Neural Network

Ships Detection on Aerial Imagery using Transfer Learning and Selective Search

Deep learning diagnostic performance and visual insights in differentiating benign and malignant thyroid nodules on ultrasound images.

Automated Preselection of Regions of Interest By Deep Learning Facilitates Rapid Whole Slide Image Analysis of Bone Marrow Smears

Linear listing order and hierarchical classification: history, conflict, and use

Predicting the efficacy of non-steroidal anti-inflammatory drugs in migraine using deep learning and three-dimensional T1-weighted images

Weakly supervised semantic segmentation via self-supervised destruction learning

Online intervention siamese tracking

UIU-Net: U-Net in U-Net for Infrared Small Object Detection.

Three-Dimensional Face Recognition Using Solid Harmonic Wavelet Scattering and Homotopy Dictionary Learning.

PDBL: Improving Histopathological Tissue Classification With Plug-and-Play Pyramidal Deep-Broad Learning.

A Computer Vision Model to Identify the Incorrect Use of Face Masks for COVID-19 Awareness

The Bangkok Urbanscapes Dataset for Semantic Urban Scene Understanding Using Enhanced Encoder-Decoder With Atrous Depthwise Separable A1 Convolutional Neural Networks

Context-aware Attentional Pooling (CAP) for Fine-grained Visual Classification

Video action detection by learning graph-based spatio-temporal interactions

Local Enhancement and Bidirectional Feature Refinement Network for Single-Shot Detector