Low-data Regime Research Articles

Background and ObjectiveHistopathology is the gold standard for diagnosis of many cancers. Recent advances in computer vision, specifically deep learning, have facilitated the analysis of histopathology images for many tasks, including the detection of immune cells and microsatellite instability. However, it remains difficult to identify optimal models and training configurations for different histopathology classification tasks due to the abundance of available architectures and the lack of systematic evaluations. Our objective in this work is to present a software tool that addresses this need and enables robust, systematic evaluation of neural network models for patch classification in histology in a light-weight, easy-to-use package for both algorithm developers and biomedical researchers. MethodsHere we present ChampKit (Comprehensive Histopathology Assessment of Model Predictions toolKit): an extensible, fully reproducible evaluation toolkit that is a one-stop-shop to train and evaluate deep neural networks for patch classification. ChampKit curates a broad range of public datasets. It enables training and evaluation of models supported by timm directly from the command line, without the need for users to write any code. External models are enabled through a straightforward API and minimal coding. As a result, Champkit facilitates the evaluation of existing and new models and deep learning architectures on pathology datasets, making it more accessible to the broader scientific community. To demonstrate the utility of ChampKit, we establish baseline performance for a subset of possible models that could be employed with ChampKit, focusing on several popular deep learning models, namely ResNet18, ResNet50, and R26-ViT, a hybrid vision transformer. In addition, we compare each model trained either from random weight initialization or with transfer learning from ImageNet pretrained models. For ResNet18, we also consider transfer learning from a self-supervised pretrained model. ResultsThe main result of this paper is the ChampKit software. Using ChampKit, we were able to systemically evaluate multiple neural networks across six datasets. We observed mixed results when evaluating the benefits of pretraining versus random intialization, with no clear benefit except in the low data regime, where transfer learning was found to be beneficial. Surprisingly, we found that transfer learning from self-supervised weights rarely improved performance, which is counter to other areas of computer vision. ConclusionsChoosing the right model for a given digital pathology dataset is nontrivial. ChampKit provides a valuable tool to fill this gap by enabling the evaluation of hundreds of existing (or user-defined) deep learning models across a variety of pathology tasks. Source code and data for the tool are freely accessible at https://github.com/SBU-BMI/champkit.

Read full abstract

Objective. Motor decoding is crucial to translate the neural activity for brain-computer interfaces (BCIs) and provides information on how motor states are encoded in the brain. Deep neural networks (DNNs) are emerging as promising neural decoders. Nevertheless, it is still unclear how different DNNs perform in different motor decoding problems and scenarios, and which network could be a good candidate for invasive BCIs. Approach. Fully-connected, convolutional, and recurrent neural networks (FCNNs, CNNs, RNNs) were designed and applied to decode motor states from neurons recorded from V6A area in the posterior parietal cortex (PPC) of macaques. Three motor tasks were considered, involving reaching and reach-to-grasping (the latter under two illumination conditions). DNNs decoded nine reaching endpoints in 3D space or five grip types using a sliding window approach within the trial course. To evaluate decoders simulating a broad variety of scenarios, the performance was also analyzed while artificially reducing the number of recorded neurons and trials, and while performing transfer learning from one task to another. Finally, the accuracy time course was used to analyze V6A motor encoding. Main results. DNNs outperformed a classic Naïve Bayes classifier, and CNNs additionally outperformed XGBoost and Support Vector Machine classifiers across the motor decoding problems. CNNs resulted the top-performing DNNs when using less neurons and trials, and task-to-task transfer learning improved performance especially in the low data regime. Lastly, V6A neurons encoded reaching and reach-to-grasping properties even from action planning, with the encoding of grip properties occurring later, closer to movement execution, and appearing weaker in darkness. Significance. Results suggest that CNNs are effective candidates to realize neural decoders for invasive BCIs in humans from PPC recordings also reducing BCI calibration times (transfer learning), and that a CNN-based data-driven analysis may provide insights about the encoding properties and the functional roles of brain regions.

Read full abstract

Low-data Regime Research Articles

Articles published on Low-data Regime

Score-Based Learning of Graphical Event Models with Background Knowledge Augmentation

Physics-informed neural ODE (PINODE): embedding physics into models using collocation points

A foundational vision transformer improves diagnostic performance for electrocardiograms

PCA-Enhanced Autoencoders for Nonlinear Dimensionality Reduction in Low Data Regimes

De Novo Design of Nurr1 Agonists via Fragment-Augmented Generative Deep Learning in Low-Data Regime.

ChampKit: A framework for rapid evaluation of deep neural networks for patch-based histopathology classification

Motor decoding from the posterior parietal cortex using deep neural networks

Reconstructing Kernel-Based Machine Learning Force Fields with Superlinear Convergence.

A Data-Efficient Deep Learning Strategy for Tissue Characterization via Quantitative Ultrasound: Zone Training.

A Robust Continuous Authentication System Using Smartphone Sensors and Wasserstein Generative Adversarial Networks

Sensor Fusion Approach for Multiple Human Motion Detection for Indoor Surveillance Use-Case.

Physics-Informed Neural Network with Fourier Features for Radiation Transport in Heterogeneous Media

Application of quantum computing to a linear non-Gaussian acyclic model for novel medical knowledge discovery.

Meta-learning biologically plausible plasticity rules with random feedback pathways

Deep generative fuel design in low data regimes via multi-objective imitation

Federated End-to-End Unrolled Models for Magnetic Resonance Image Reconstruction.

An Efficient Data Augmentation Method for Automatic Modulation Recognition from Low-Data Imbalanced-Class Regime

A thermodynamics-informed active learning approach to perception and reasoning about fluids

Putting Chemical Knowledge to Work in Machine Learning for Reactivity.

Modular machine learning-based elastoplasticity: Generalization in the context of limited data

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Low-data Regime Research Articles

Articles published on Low-data Regime

Score-Based Learning of Graphical Event Models with Background Knowledge Augmentation

Physics-informed neural ODE (PINODE): embedding physics into models using collocation points

A foundational vision transformer improves diagnostic performance for electrocardiograms

PCA-Enhanced Autoencoders for Nonlinear Dimensionality Reduction in Low Data Regimes

De Novo Design of Nurr1 Agonists via Fragment-Augmented Generative Deep Learning in Low-Data Regime.

ChampKit: A framework for rapid evaluation of deep neural networks for patch-based histopathology classification

Motor decoding from the posterior parietal cortex using deep neural networks

Reconstructing Kernel-Based Machine Learning Force Fields with Superlinear Convergence.

A Data-Efficient Deep Learning Strategy for Tissue Characterization via Quantitative Ultrasound: Zone Training.

A Robust Continuous Authentication System Using Smartphone Sensors and Wasserstein Generative Adversarial Networks

Sensor Fusion Approach for Multiple Human Motion Detection for Indoor Surveillance Use-Case.

Physics-Informed Neural Network with Fourier Features for Radiation Transport in Heterogeneous Media

Application of quantum computing to a linear non-Gaussian acyclic model for novel medical knowledge discovery.

Meta-learning biologically plausible plasticity rules with random feedback pathways

Deep generative fuel design in low data regimes via multi-objective imitation

Federated End-to-End Unrolled Models for Magnetic Resonance Image Reconstruction.

An Efficient Data Augmentation Method for Automatic Modulation Recognition from Low-Data Imbalanced-Class Regime

A thermodynamics-informed active learning approach to perception and reasoning about fluids

Putting Chemical Knowledge to Work in Machine Learning for Reactivity.

Modular machine learning-based elastoplasticity: Generalization in the context of limited data