Number Of Annotated Examples Research Articles

With the advancement in pose estimation techniques, human posture detection recently received considerable attention in many applications, including ergonomics and healthcare. When using neural network models, overfitting and poor performance are prevalent issues. Recently, convolutional neural networks (CNNs) were successfully used for human posture recognition from human images due to their superior multiscale high-level visual representations over hand-engineering low-level characteristics. However, calculating millions of parameters in a deep CNN requires a significant number of annotated examples, which prohibits many deep CNNs such as AlexNet and VGG16 from being used on issues with minimal training data. We propose a new three-phase model for decision support that integrates CNN transfer learning, image data augmentation, and hyperparameter optimization (HPO) to address this problem. The model is used as part of a new decision support framework for the optimization of hyperparameters for AlexNet, VGG16, CNN, and multilayer perceptron (MLP) models for accomplishing optimal classification results. The AlexNet and VGG16 transfer learning algorithms with HPO are used for human posture detection, while CNN and Multilayer Perceptron (MLP) were used as standard classifiers for contrast. The HPO methods are essential for machine learning and deep learning algorithms because they directly influence the behaviors of training algorithms and have a major impact on the performance of machine learning and deep learning models. We used an image data augmentation technique to increase the number of images to be used for model training to reduce model overfitting and improve classification performance using the AlexNet, VGG16, CNN, and MLP models. The optimal combination of hyperparameters was found for the four models using a random-based search strategy. The MPII human posture datasets were used to test the proposed approach. The proposed models achieved an accuracy of 91.2% using AlexNet, 90.2% using VGG16, 87.5% using CNN, and 89.9% using MLP. The study is the first HPO study executed on the MPII human pose dataset.

Read full abstract

Bone age assessment (BAA) has various clinical applications such as diagnosis of endocrine disorders and prediction of final adult height for adolescents. Recent studies indicate that deep learning techniques have great potential in developing automated BAA methods with significant advantages over the conventional methods based on handcrafted features. In this paper, we propose a multi-scale data fusion framework for bone age assessment with X-ray images based on non-subsampled contourlet transform (NSCT) and convolutional neural networks (CNNs). Unlike the existing CNN-based BAA methods that adopt the original spatial domain image as network input directly, we pre-extract a rich set of features for the input image by performing NSCT to obtain its multi-scale and multi-direction representations. This feature pre-extraction strategy could be beneficial to network training as the number of annotated examples in the problem of BAA is typically quite limited. The obtained NSCT coefficient maps at each scale are fed into a convolutional network individually and the information from different scales are then merged to achieve the final prediction. Specifically, two CNN models with different data fusion strategies are presented for BAA: a regression model with feature-level fusion and a classification model with decision-level fusion. Experiments on the public BAA dataset Digital Hand Atlas demonstrate that the proposed method can obtain promising results and outperform many state-of-the-art BAA methods. In particular, the proposed approaches exhibit obvious advantages over the corresponding spatial domain approaches (generally with an improvement of more than 0.1 years on the mean absolute error), showing great potential in the future study of this field.

Read full abstract

Number Of Annotated Examples Research Articles

Articles published on Number Of Annotated Examples

Active Gaze Labeling: Visualization for Trust Building.

Human Posture Detection Using Image Augmentation and Hyperparameter-Optimized Transfer Learning Algorithms

Synthetic document generator for annotation-free layout recognition

Natural language processing to facilitate breast cancer research and management.

A multi-scale data fusion framework for bone age assessment with convolutional neural networks

Acquiring Word-Meaning Mappings for Natural Language Interfaces

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Number Of Annotated Examples Research Articles

Articles published on Number Of Annotated Examples

Active Gaze Labeling: Visualization for Trust Building.

Human Posture Detection Using Image Augmentation and Hyperparameter-Optimized Transfer Learning Algorithms

Synthetic document generator for annotation-free layout recognition

Natural language processing to facilitate breast cancer research and management.

A multi-scale data fusion framework for bone age assessment with convolutional neural networks

Acquiring Word-Meaning Mappings for Natural Language Interfaces