Two-step Learning Approach Research Articles

Predictive modeling is useful but very challenging in biological image analysis due to the high cost of obtaining and labeling training data. For example, in the study of gene interaction and regulation in Drosophila embryogenesis, the analysis is most biologically meaningful when in situ hybridization (ISH) gene expression pattern images from the same developmental stage are compared. However, labeling training data with precise stages is very time-consuming even for developmental biologists. Thus, a critical challenge is how to build accurate computational models for precise developmental stage classification from limited training samples. In addition, identification and visualization of developmental landmarks are required to enable biologists to interpret prediction results and calibrate models. To address these challenges, we propose a deep two-step low-shot learning framework to accurately classify ISH images using limited training images. Specifically, to enable accurate model training on limited training samples, we formulate the task as a deep low-shot learning problem and develop a novel two-step learning approach, including data-level learning and feature-level learning. We use a deep residual network as our base model and achieve improved performance in the precise stage prediction task of ISH images. Furthermore, the deep model can be interpreted by computing saliency maps, which consists of pixel-wise contributions of an image to its prediction result. In our task, saliency maps are used to assist the identification and visualization of developmental landmarks. Our experimental results show that the proposed model can not only make accurate predictions but also yield biologically meaningful interpretations. We anticipate our methods to be easily generalizable to other biological image classification tasks with small training datasets. Our open-source code is available at https://github.com/divelab/lsl-fly.

Read full abstract

As a common problem in classification tasks, class imbalance degrades the performance of the classifier. Catastrophic out-of-pocket (OOP) health expenditure is a specific example of a rare event faced by very few households. The objective of the present study is to demonstrate a two-step learning approach for modeling highly unbalanced catastrophic OOP health expenditure data. The data are retrieved from the nationally representative Household Budget Survey collected in 2012 by the Turkish Statistical Institute. In total, 9987 households returned valid survey responses. The predictive models are based on eight common risk factors of catastrophic OOP health expenditure. The minority class in the training dataset is oversampled by using a synthetic minority oversampling technique (SMOTE) function, and the original and balanced oversampled training datasets are used to establish the classification models. Logistic regression (LR), random forest (RF) (100 trees), support vector machine (SVM), and neural network (NN) are determined as classifiers. The weighted percentage of households faced with catastrophic OOP health expenditure is 0.14. Balanced oversampling increases the area under the receiver operating characteristic (ROC) curve of LR, RF, SVM, and NN by 0.08%, 0.62%, 0.20%, and 0.23%, respectively. The ROC curve shows NN and RF to be the best classifiers for a balanced oversampled dataset. Identifying a classifier to model highly imbalanced catastrophic OOP health expenditure requires the two-stage procedure of (i) considering a balance between classes and (ii) comparing alternative classifiers. NN and RF are good classifiers in a prediction task with imbalanced catastrophic OOP health expenditure data.

Read full abstract

Two-step Learning Approach Research Articles

Related Topics

Articles published on Two-step Learning Approach

Knowledge-Informed Sparse Learning for Relevant Feature Selection and Optimal Quality Prediction

Deep Low-Shot Learning for Biological Image Classification and Visualization From Limited Training Samples.

Predicting individual quality ratings of compressed images through deep CNNs-based artificial observers

The impact of oversampling with “ubSMOTE” on the performance of machine learning classifiers in prediction of catastrophic health expenditures

Fast approximation of variational Bayes Dirichlet process mixture using the maximization–maximization algorithm

Nonlinear System Identification Using Quasi-ARX RBFN Models with a Parameter-Classified Scheme

TrendLearner: Early prediction of popularity trends of user generated content

Quantum inspired PSO for the optimization of simultaneous recurrent neural networks as MIMO learning systems

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Two-step Learning Approach Research Articles

Related Topics

Articles published on Two-step Learning Approach

Knowledge-Informed Sparse Learning for Relevant Feature Selection and Optimal Quality Prediction

Deep Low-Shot Learning for Biological Image Classification and Visualization From Limited Training Samples.

Predicting individual quality ratings of compressed images through deep CNNs-based artificial observers

The impact of oversampling with “ubSMOTE” on the performance of machine learning classifiers in prediction of catastrophic health expenditures

Fast approximation of variational Bayes Dirichlet process mixture using the maximization–maximization algorithm

Nonlinear System Identification Using Quasi-ARX RBFN Models with a Parameter-Classified Scheme

TrendLearner: Early prediction of popularity trends of user generated content

Quantum inspired PSO for the optimization of simultaneous recurrent neural networks as MIMO learning systems