Layer Neural Network Research Articles

The receptive field (RF) plays a crucial role in convolutional neural networks (CNNs) because it determines the amount of input information that each neuron in a CNN can perceive, which directly affects the feature extraction ability. As the number of convolutional layers in CNNs increases, there is a decay of the RF according to the two-dimensional Gaussian distribution. Thus, an effective receptive field (ERF) can be used to characterize the available part of the RF. The ERF is calculated by the kernel size and layer number within the neural network architecture. Currently, ERF calculation methods are typically applied to single-channel input data that are both independent and identically distributed. However, such methods may result in a loss of effective information if they are applied to more general (i.e., multi-channel) datasets. Therefore, we proposed a multi-channel ERF calculation method. By conducting a series of numerical experiments, we determined the relationship between the ERF and the convolutional kernel size in conjunction with the layer number. To validate the new method, we used the recently published global wave surrogate model for climate simulation (GWSM4C) and its accompanying dataset. According to the newly established relationship, we refined the kernel size and layer number in each neural network of the GWSM4C to produce the same ERF but lower RF attenuation rates than those of the original version. By visualizing the gradient map at several points in West African and East Pacific areas, the high gradient value regions confirmed the known swell sources, which indicated effective feature extraction in these areas. Furthermore, the new version of the GWSM4C yielded better prediction accuracy for significant wave height in global swell pools. The root mean square errors in the West African and East Pacific regions reduced from approximately 0.3 m, in the original model to about 0.15 m, in the new model. Moreover, these improvements were attributed to the higher efficiency of the newly modified neural network structure that allows the inclusion of more historical winds while maintaining acceptable computational consumption.

Early screening for diabetes can promptly identify potential early stage patients, possibly delaying complications and reducing mortality rates. This paper presents a novel technique for early diabetes screening and prediction, called the Attention-Enhanced Deep Neural Network (AEDNN). The proposed AEDNN model incorporates an Attention-based Feature Weighting Layer combined with deep neural network layers to achieve precise diabetes prediction. In this study, we utilized the Diabetes-NHANES dataset and the Pima Indians Diabetes dataset. To handle significant missing values and outliers, group median imputation was applied. Oversampling techniques were used to balance the diabetes and non-diabetes groups. The data were processed through an Attention-based Feature Weighting Layer for feature extraction, producing a feature matrix. This matrix was subjected to Hadamard product operations with the raw data to obtain weighted data, which were subsequently input into deep neural network layers for training. The parameters were fine-tuned and the L2 regularization and dropout layers were added to enhance the generalization performance of the model. The model’s reliability was thoroughly assessed through various metrics, including the accuracy, precision, recall, F1 score, mean squared error (MSE), and R2 score, as well as the ROC and AUC curves. The proposed model achieved a prediction accuracy of 98.4% in the Pima Indians Diabetes dataset. When the test dataset was expanded to the large-scale Diabetes-NHANES dataset, which contains 52,390 samples, the test precision of the model improved further to 99.82%, with an AUC of 0.9995. A comparative analysis was conducted using multiple models, including logistic regression with L1 regularization, support vector machine (SVM), random forest, K-nearest neighbors (KNNs), AdaBoost, XGBoost, and the latest semi-supervised XGBoost. The feature extraction method using attention mechanisms was compared with the classical feature selection methods, Lasso and Ridge. The experiments were performed on the same dataset, and the conclusion was that the Attention-based Ensemble Deep Neural Network (AEDNN) outperformed all the aforementioned methods. These results indicate that the model not only performs well on smaller datasets but also fully leverages its advantages on larger datasets, demonstrating strong generalization ability and robustness. The proposed model can effectively assist clinicians in the early screening of diabetes patients. This is particularly beneficial for the preliminary screening of high-risk individuals in large-scale, extensive healthcare datasets, followed by detailed examination and diagnosis. Compared to the existing methods, our AEDNN model showed an overall performance improvement of 1.75%.

Layer Neural Network Research Articles

Related Topics

Articles published on Layer Neural Network

A new fusion neural network model and credit card fraud identification.

Numerical investigation of the effective receptive field and its relationship with convolutional kernels and layers in convolutional neural network

Utilizing Attention-Enhanced Deep Neural Networks for Large-Scale Preliminary Diabetes Screening in Population Health Data

Predicting Bacterial Antibiotic Resistance using MALDI-TOF Mass Spectrometry Databases with ELM Applications.

Deep-time neural networks: An efficient approach for solving high-dimensional PDEs

Investigating 10 Yr of Volcanoacoustic Activity at Tungurahua Volcano, Ecuador, Aided by Machine Learning

Distance preserving machine learning for uncertainty aware accelerator capacitance predictions

AWDP-FL: An Adaptive Differential Privacy Federated Learning Framework

The topology and geometry of neural representations

Brain-body-task co-adaptation can improve autonomous learning and speed of bipedal walking.

Trainable signal encoders that are robust against noise

Validation of large language models for detecting pathologic complete response in breast cancer using population-based pathology reports

A 1000FPS@360,000pixels mixed-signal sensing with computing macro featuring analog compression and maximum parallelism for objective detection tasks

Industrial robot energy consumption model identification: A coupling model-driven and data-driven paradigm

Improved JPEG Lossless Compression for Compression of Intermediate Layers in Neural Networks Based on Compute-In-Memory

Optimal layer selection for latent data augmentation

Evaluating the Impact of Convolutional Neural Network Layer Depth on the Enhancement of Inertial Navigation System Solutions

Creep Lifetime Prediction of Alloy 617 using Black Box Machine Learning Approach

Q-Deformed and delta-parametrized A-generalized logistic function induced Banach space valued multivariate multi layer neural network approximations

Development and validation of machine learning models for predicting cancer-related fatigue in lymphoma survivors

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Layer Neural Network Research Articles

Related Topics

Articles published on Layer Neural Network

A new fusion neural network model and credit card fraud identification.

Numerical investigation of the effective receptive field and its relationship with convolutional kernels and layers in convolutional neural network

Utilizing Attention-Enhanced Deep Neural Networks for Large-Scale Preliminary Diabetes Screening in Population Health Data

Predicting Bacterial Antibiotic Resistance using MALDI-TOF Mass Spectrometry Databases with ELM Applications.

Deep-time neural networks: An efficient approach for solving high-dimensional PDEs

Investigating 10 Yr of Volcanoacoustic Activity at Tungurahua Volcano, Ecuador, Aided by Machine Learning

Distance preserving machine learning for uncertainty aware accelerator capacitance predictions

AWDP-FL: An Adaptive Differential Privacy Federated Learning Framework

The topology and geometry of neural representations

Brain-body-task co-adaptation can improve autonomous learning and speed of bipedal walking.

Trainable signal encoders that are robust against noise

Validation of large language models for detecting pathologic complete response in breast cancer using population-based pathology reports

A 1000FPS@360,000pixels mixed-signal sensing with computing macro featuring analog compression and maximum parallelism for objective detection tasks

Industrial robot energy consumption model identification: A coupling model-driven and data-driven paradigm

Improved JPEG Lossless Compression for Compression of Intermediate Layers in Neural Networks Based on Compute-In-Memory

Optimal layer selection for latent data augmentation

Evaluating the Impact of Convolutional Neural Network Layer Depth on the Enhancement of Inertial Navigation System Solutions

Creep Lifetime Prediction of Alloy 617 using Black Box Machine Learning Approach

Q-Deformed and delta-parametrized A-generalized logistic function induced Banach space valued multivariate multi layer neural network approximations

Development and validation of machine learning models for predicting cancer-related fatigue in lymphoma survivors