Statistical Aspects of High-Dimensional Sparse Artificial Neural Network Models

Kaixu Yang,Tapabrata Maiti

doi:10.3390/make2010001

Abstract

An artificial neural network (ANN) is an automatic way of capturing linear and nonlinear correlations, spatial and other structural dependence among features. This machine performs well in many application areas such as classification and prediction from magnetic resonance imaging, spatial data and computer vision tasks. Most commonly used ANNs assume the availability of large training data compared to the dimension of feature vector. However, in modern applications, as mentioned above, the training sample sizes are often low, and may be even lower than the dimension of feature vector. In this paper, we consider a single layer ANN classification model that is suitable for analyzing high-dimensional low sample-size (HDLSS) data. We investigate the theoretical properties of the sparse group lasso regularized neural network and show that under mild conditions, the classification risk converges to the optimal Bayes classifier’s risk (universal consistency). Moreover, we proposed a variation on the regularization term. A few examples in popular research fields are also provided to illustrate the theory and methods.

Highlights

High-dimensional models with correlated predictors are commonly seen in practice
Neural networks have been applied in practice for years, which have a good performance under correlated predictors
The first example is a revisit of the simulation study in [17], where we show numerical results that the sparse group lasso neural network (SGLNN)’s performance is close to the Deep Neural Persuit (DNP)’s performance in their set up

Summary

Introduction

High-dimensional models with correlated predictors are commonly seen in practice. Most statistical models work well either in low-dimensional correlated case, or in high-dimensional independent case. There are few methods that deal with high-dimensional correlated predictors, which usually have limited theoretical and practical capacity. The lasso part further shrinks some weights of the selected inputs features to zero: A feature does not need to be connected to all nodes in the hidden layer when selected This penalization encourages as many zero weights as possible. The existing results include the universal approximation capabilities of single layer neural networks, the estimation and classification consistency under the Gaussian assumption and 0-1 loss in the low dimensional case. These theory assumes the 0-1 loss which is not used nowadays and are not sufficient for high-dimensional case as considered here.

The Binary Classification Problem

The Consistency of Neural Network Classification Risk

Simulation

Dnp Simulation

Smaller Sample Size Case

Real Data Examples

Example 1

Example 2

Example

Findings

Discussion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Machine Learning and Knowledge Extraction	Publication Date: Jan 2, 2020
Citations: 7	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Statistical Aspects of High-Dimensional Sparse Artificial Neural Network Models

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Machine Learning and Knowledge Extraction

Lead the way for us

Similar Papers

On the Classification Consistency of High-Dimensional Sparse Neural Network
Kaixu Yang ... Taps Maiti
-
Kaixu Yang, et. al.Kaixu Yang ... Taps Maiti
01 Oct 2019
01 Oct 2019

Regional low‐flow frequency analysis using single and ensemble artificial neural networks
T B M J Ouarda ... C Shu
Water Resources Research | VOL. 45
T B M J Ouarda, et. al.T B M J Ouarda ... C Shu
01 Nov 2009
Water Resources Research | VOL. 45

Data Augmentation in High Dimensional Low Sample Size Setting Using a Geometry-Based Variational Autoencoder.
Clément Chadebec ... Ninon Burgos
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45
Clément Chadebec, et. al.Clément Chadebec ... Ninon Burgos
01 Mar 2023
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45

Homogeneous Ensemble FeedForward Neural Network in CIMB Stock Price Forecasting
Kim Soon Gan ... Kim On Chin
-
Kim Soon Gan, et. al.Kim Soon Gan ... Kim On Chin
01 Nov 2018
01 Nov 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Statistical Aspects of High-Dimensional Sparse Artificial Neural Network Models

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Machine Learning and Knowledge Extraction