Nonlinear Kernel Research Articles

BackgroundMachine learning (ML) prediction models in healthcare and pharmacy-related research face challenges with encoding high-dimensional Healthcare Coding Systems (HCSs) such as ICD, ATC, and DRG codes, given the trade-off between reducing model dimensionality and minimizing information loss. ObjectivesTo investigate using Network Analysis modularity as a method to group HCSs to improve encoding in ML models. MethodsThe MIMIC-III dataset was utilized to create a multimorbidity network in which ICD-9 codes are the nodes and the edges are the number of patients sharing the same ICD-9 code pairs. A modularity detection algorithm was applied using different resolution thresholds to generate 6 sets of modules. The impact of four grouping strategies on the performance of predicting 90-day Intensive Care Unit readmissions was assessed. The grouping strategies compared: 1) binary encoding of codes, 2) encoding codes grouped by network modules, 3) grouping codes to the highest level of ICD-9 hierarchy, and 4) grouping using the single-level Clinical Classification Software (CCS). The same methodology was also applied to encode DRG codes but limiting the comparison to a single modularity threshold to binary encoding.The performance was assessed using Logistic Regression, Support Vector Machine with a non-linear kernel, and Gradient Boosting Machines algorithms. Accuracy, Precision, Recall, AUC, and F1-score with 95% confidence intervals were reported. ResultsModels utilized modularity encoding outperformed ungrouped codes binary encoding models. The accuracy improved across all algorithms ranging from 0.736 to 0.78 for the modularity encoding, to 0.727 to 0.779 for binary encoding. AUC, recall, and precision also improved across almost all algorithms. In comparison with other grouping approaches, modularity encoding generally showed slightly higher performance in AUC, ranging from 0.813 to 0.837, and precision, ranging from 0.752 to 0.782. ConclusionsModularity encoding enhances the performance of ML models in pharmacy research by effectively reducing dimensionality and retaining necessary information. Across the three algorithms used, models utilizing modularity encoding showed superior or comparable performance to other encoding approaches. Modularity encoding introduces other advantages such as it can be used for both hierarchical and non-hierarchical HCSs, the approach is clinically relevant, and can enhance ML models' clinical interpretation. A Python package has been developed to facilitate the use of the approach for future research.

Read full abstract

Abstract. In soil sciences, parametric models known as constitutive models (e.g., the Modified Cam Clay and the NorSand) are used to represent the behavior of natural and artificial materials. In contexts where liquefaction may occur, the NorSand constitutive model has been extensively applied by both industry and academia due to its relatively simple critical state formulation and low number of input parameters. Despite its suitability as a good modeling framework to assess static liquefaction, the NorSand model still is based on premises which may not perfectly represent the behavior of all soil types. In this context, the creation of data-driven and physically informed metamodels emerges. The literature suggests that data-driven models should initially be developed using synthetic datasets to establish a general framework, which can later be applied to experimental datasets to enhance the model's robustness and aid in discovering potential mechanisms of soil behavior. Therefore, creating large and reliable synthetic datasets is a crucial step in constructing data-driven constitutive models. In this context, the NorSand model comes in handy: by using NorSand simulations as the training dataset, data-driven constitutive metamodels can then be fine-tuned using real test results. The models created that way will combine the power of NorSand with the flexibility provided by data-driven approaches, enhancing the modeling capabilities for liquefaction. Therefore, for a material following the NorSand model, the present paper presents a first-of-its-kind database that addresses the size and complexity issues of creating synthetic datasets for nonlinear constitutive modeling of soils by simulating both drained and undrained triaxial tests. Two datasets are provided: the first one considers a nested Latin hypercube sampling of input parameters encompassing 2000 soil types, each subjected to 40 initial test configurations, resulting in a total of 160 000 triaxial test results. The second one considers nested quasi-Monte Carlo sampling techniques (Sobol and Halton) of input parameters encompassing 2048 soil types, each subjected to 42 initial test configurations, resulting in a total of 172 032 triaxial test results. By using the quasi-Monte Carlo dataset and 49 of its subsamples, it is shown that the dataset of 2000 soil types and 40 initial test configurations is sufficient to represent the general behavior of the NorSand model. In this process, four machine learning algorithms (Ridge Regressor, KNeighbors Regressor and two variants of the Ridge Regressor which incorporate nonlinear Nystroem kernel mappings of the input and output values) were trained to predict the constitutive and test parameters based solely on the triaxial test results. These algorithms achieved 13.91 % and 16.18 % mean absolute percentage errors among all 14 predicted parameters for undrained and drained triaxial test inputs, respectively. As a secondary outcome, this work introduces a Python script that links the established Visual Basic implementation of NorSand to the Python environment. This enables researchers to leverage the comprehensive capabilities of Python packages in their analyses related to this constitutive model.

Read full abstract

Nonlinear Kernel Research Articles

Related Topics

Articles published on Nonlinear Kernel

Prediction of fishbone linear instability in tokamaks with machine learning methods

Random Fourier features based nonlinear recurrent kernel normalized LMS algorithm with multiple feedbacks

Extraordinarily Time- and Memory-Efficient Large-Scale Canonical Correlation Analysis in Fourier Domain: From Shallow to Deep.

Sub-sampling graph neural networks for genomic prediction of quantitative phenotypes.

Thermomechanical interaction in a living tissue due to variable thermal loading with memory

Resting-state frontal electroencephalography (EEG) biomarkers for detecting the severity of chronic neuropathic pain

Bayesian inference of the spatial distribution of steel corrosion in reinforced concrete structures using corrosion-induced crack width

Adaptive Sparse Regular Split Gaussian Kernel Least Mean Square Algorithm for Super-Low-Frequency Motion-Induced Noise Cancellation

Effectiveness of nonlinear kernel with memory for a functionally graded solid with size dependency

Comparative Temporal Analysis of SVM-Based Machine Learning Techniques for Bank Risk Assessment

Existence results, regularity and compactness properties, in the $$\alpha $$-norm, for semilinear partial functional integrodifferential equations with nonlinear Kernel and delay argument

Intuitionistic fuzzy twin proximal SVM with fuzzy hyperplane and its application in EEG signal classification

Using Data Science Tools to Reveal and Understand Subtle Relationships of Inhibitor Structure in Frontal Ring-Opening Metathesis Polymerization.

Connectome embedding in multidimensional graph spaces

SAR target recognition through adaptive kernel sparse representation model based on local contrast perception

Accounting for nonlinear responses to traits improves range shift predictions

“Using network analysis modularity to group health code systems and decrease dimensionality in machine learning models”

Non-linear Kernel Optimisation of Support Vector Machine Algorithm for Online Marketplace Sentiment Analysis

Fast, accurate, and interpretable decoding of electrocorticographic signals using dynamic mode decomposition

NorSand4AI: a comprehensive triaxial test simulation database for NorSand constitutive model materials

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Nonlinear Kernel Research Articles

Related Topics

Articles published on Nonlinear Kernel

Prediction of fishbone linear instability in tokamaks with machine learning methods

Random Fourier features based nonlinear recurrent kernel normalized LMS algorithm with multiple feedbacks

Extraordinarily Time- and Memory-Efficient Large-Scale Canonical Correlation Analysis in Fourier Domain: From Shallow to Deep.

Sub-sampling graph neural networks for genomic prediction of quantitative phenotypes.

Thermomechanical interaction in a living tissue due to variable thermal loading with memory

Resting-state frontal electroencephalography (EEG) biomarkers for detecting the severity of chronic neuropathic pain

Bayesian inference of the spatial distribution of steel corrosion in reinforced concrete structures using corrosion-induced crack width

Adaptive Sparse Regular Split Gaussian Kernel Least Mean Square Algorithm for Super-Low-Frequency Motion-Induced Noise Cancellation

Effectiveness of nonlinear kernel with memory for a functionally graded solid with size dependency

Comparative Temporal Analysis of SVM-Based Machine Learning Techniques for Bank Risk Assessment

Existence results, regularity and compactness properties, in the $$\alpha $$-norm, for semilinear partial functional integrodifferential equations with nonlinear Kernel and delay argument

Intuitionistic fuzzy twin proximal SVM with fuzzy hyperplane and its application in EEG signal classification

Using Data Science Tools to Reveal and Understand Subtle Relationships of Inhibitor Structure in Frontal Ring-Opening Metathesis Polymerization.

Connectome embedding in multidimensional graph spaces

SAR target recognition through adaptive kernel sparse representation model based on local contrast perception

Accounting for nonlinear responses to traits improves range shift predictions

“Using network analysis modularity to group health code systems and decrease dimensionality in machine learning models”

Non-linear Kernel Optimisation of Support Vector Machine Algorithm for Online Marketplace Sentiment Analysis

Fast, accurate, and interpretable decoding of electrocorticographic signals using dynamic mode decomposition

NorSand4AI: a comprehensive triaxial test simulation database for NorSand constitutive model materials