Real-world Application Domains Research Articles

To study the suitability of costsensitive ordinal artificial intelligence-machine learning (AIML) strategies in the prognosis of SARS-CoV-2 pneumonia severity. Observational, retrospective, longitudinal, cohort study in 4 hospitals in Spain. Information regarding demographic and clinical status was supplemented by socioeconomic data and air pollution exposures. We proposed AI-ML algorithms for ordinal classification via ordinal decomposition and for cost-sensitive learning via resampling techniques. For performance-based model selection, we defined a custom score including per-class sensitivities and asymmetric misprognosis costs. 260 distinct AI-ML models were evaluated via 10 repetitions of 5×5 nested cross-validation with hyperparameter tuning. Model selection was followed by the calibration of predicted probabilities. Final overall performance was compared against five well-established clinical severity scores and against a 'standard' (non-cost sensitive, non-ordinal) AI-ML baseline. In our best model, we also evaluated its explainability with respect to each of the input variables. The study enrolled n = 1548 patients: 712 experienced low, 238 medium, and 598 high clinical severity. d = 131 variables were collected, becoming d ' = 148 features after categorical encoding. Model selection resulted in our best-performing AI-ML pipeline having: a) no imputation of missing data, b) no feature selection (i.e. using the full set of d ' features), c) 'Ordered Partitions' ordinal decomposition, d) cost-based reimbalance, and e) a Histogram-based Gradient Boosting classifier. This best model (calibrated) obtained a median accuracy of 68.1% [67.3%, 68.8%] (95% confidence interval), a balanced accuracy of 57.0% [55.6%, 57.9%], and an overall area under the curve (AUC) 0.802 [0.795, 0.808]. In our dataset, it outperformed all five clinical severity scores and the 'standard' AI-ML baseline. We conducted an exhaustive exploration of AI-ML methods designed for both ordinal and cost-sensitive classification, motivated by a real-world application domain (clinical severity prognosis) in which these topics arise naturally. Our model with the best classification performance exploited successfully the ordering information of ground truth classes, coping with imbalance and asymmetric costs. However, these ordinal and cost-sensitive aspects are seldom explored in the literature.

Read full abstract

Since gas turbine plays a key role in electricity power generating, the requirements on the safety and reliability of this classical thermal system are becoming gradually strict. With a large amount of renewable energy being integrated into the power grid, the request of deep peak load regulation for satisfying the varying demand of users and maintaining the stability of the whole power grid leads to more unstable working conditions of gas turbines. The startup, shutdown, and load fluctuation are dominating the operating condition of gas turbines. Hence simulating and analyzing the dynamic behavior of the engines under such instable working conditions are important in improving their design, operation, and maintenance. However, conventional dynamic simulation methods based on the physic differential equations is unable to tackle the uncertainty and noise when faced with variant real-world operations. Although data-driven simulating methods, to some extent, can mitigate the problem, it is impossible to perform simulations with insufficient data. To tackle the issue, a novel transfer learning framework is proposed to transfer the knowledge from the physics equation domain to the real-world application domain to compensate for the lack of data. A strong dynamic operating data set with steep slope signals is created based on physics equations and then a feature similarity-based learning model with an encoder and a decoder is built and trained to achieve feature adaptive knowledge transferring. The simulation accuracy is significantly increased by 24.6% and the predicting error reduced by 63.6% compared with the baseline model. Moreover, compared with the other classical transfer learning modes, the method proposed has the best simulating performance on field testing data set. Furthermore, the effect study on the hyper parameters indicates that the method proposed is able to adaptively balance the weight of learning knowledge from the physical theory domain or from the real-world operation domain.

Read full abstract

Real-world Application Domains Research Articles

Related Topics

Articles published on Real-world Application Domains

Cost-sensitive ordinal classification methods to predict SARS-CoV-2 pneumonia severity.

Assisted design of data science pipelines

Finding Subgraphs with Maximum Total Density and Limited Overlap in Weighted Hypergraphs

Feature extraction and representation learning of 3D point cloud data

Coupled Attention Networks for Multivariate Time Series Anomaly Detection

The dynamical study and analysis of diverse bright-dark and breathers wave solutions of nonlinear evolution equations and their applications

Anthropogenic Object Localization: Evaluation of Broad-Area High-Resolution Imagery Scans Using Deep Learning in Overhead Imagery.

Clustered Task-Aware Meta-Learning by Learning From Learning Paths.

Synthesizing credit data using autoencoders and generative adversarial networks

CEEMD-MultiRocket: Integrating CEEMD with Improved MultiRocket for Time Series Classification

A memetic algorithm for finding multiple subgraphs that optimally cover an input network.

Group-preserving label-specific feature selection for multi-label learning

DORIAN in action

Efficient Coalition Structure Generation via Approximately Equivalent Induced Subgraph Games.

Data Science and Analytics: An Overview from Data-Driven Smart Computing, Decision-Making and Applications Perspective.

Machine Learning: Algorithms, Real-World Applications and Research Directions.

Dynamic simulation of gas turbines via feature similarity-based transfer learning

Visual interpretation of regression error

Mobile Apps as Personal Assistant Agents: the JaCa-Android Framework for programming Agents-based applications on mobile devices

A novel validation framework to enhance deep learning models in time-series forecasting

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Real-world Application Domains Research Articles

Related Topics

Articles published on Real-world Application Domains

Cost-sensitive ordinal classification methods to predict SARS-CoV-2 pneumonia severity.

Assisted design of data science pipelines

Finding Subgraphs with Maximum Total Density and Limited Overlap in Weighted Hypergraphs

Feature extraction and representation learning of 3D point cloud data

Coupled Attention Networks for Multivariate Time Series Anomaly Detection

The dynamical study and analysis of diverse bright-dark and breathers wave solutions of nonlinear evolution equations and their applications

Anthropogenic Object Localization: Evaluation of Broad-Area High-Resolution Imagery Scans Using Deep Learning in Overhead Imagery.

Clustered Task-Aware Meta-Learning by Learning From Learning Paths.

Synthesizing credit data using autoencoders and generative adversarial networks

CEEMD-MultiRocket: Integrating CEEMD with Improved MultiRocket for Time Series Classification

A memetic algorithm for finding multiple subgraphs that optimally cover an input network.

Group-preserving label-specific feature selection for multi-label learning

DORIAN in action

Efficient Coalition Structure Generation via Approximately Equivalent Induced Subgraph Games.

Data Science and Analytics: An Overview from Data-Driven Smart Computing, Decision-Making and Applications Perspective.

Machine Learning: Algorithms, Real-World Applications and Research Directions.

Dynamic simulation of gas turbines via feature similarity-based transfer learning

Visual interpretation of regression error

Mobile Apps as Personal Assistant Agents: the JaCa-Android Framework for programming Agents-based applications on mobile devices

A novel validation framework to enhance deep learning models in time-series forecasting