Sparse Datasets Research Articles

Accurate and detailed data are vital for fundamental understanding of turbulent combustion. However, studies of turbulent combustion often suffer from measurement sparsity or high simulation cost. In the present work, for the first time, a physics-informed neural networks (PINNs) framework is established for three-dimensional high-resolution reconstruction of turbulent combustion with synthetic sparse data. The performance of the PINNs is evaluated on two different configurations of turbulent flames without and with a mean shear, including freely propagating planar premixed combustion and slot-jet premixed combustion. The reconstructed fields of velocity, temperature, and species mass fractions are compared with the high-fidelity direct numerical simulation (DNS) data, and both qualitative and quantitative analyses are performed for the model performance evaluation. The results show that by constraining with the residuals of the governing equations and limited information of sparse data, the proposed PINNs can recover the majority of flow and flame structures in a physics-informed way, even when noise has been added to the sparse data. It is noted that the effectiveness of the models can be influenced by various factors such as the size of the sparse dataset, noise levels, and complexity of turbulence/flame interactions. These factors should be carefully evaluated and addressed to ensure reliable predictions. Overall, this study highlights the potential of PINNs for data assimilation and provide new insights for the development of physics-informed methods in combustion research.Novelty and significanceIn the present work, a physics-informed neural networks (PINNs) framework for turbulent combustion has been established and the possibility of PINNs for high-resolution reconstruction of various field data in turbulent combustion was explored, which is the first of its kind. The velocity, temperature and species mass fractions were simultaneously reconstructed from limited pointwise sparse data, which were compared with the high-fidelity direct numerical simulation data with good agreements. The proposed PINNs have the capability to recover the flow and flame structures for turbulent flames without and with mean shear layers. Various factors such as the size of the sparse dataset, noise level, and complexity of turbulence/flame interactions on the model performance were also assessed. The study highlights the potential of PINNs for data assimilation and provide new insights for the development of physics-informed methods in combustion research.

Read full abstract

Accurately predicting patient outcomes is crucial for improving healthcare delivery, but large-scale risk prediction models are often developed and tested on specific datasets where clinical parameters and outcomes may not fully reflect local clinical settings. Where this is the case, whether to opt for de-novo training of prediction models on local datasets, direct porting of externally trained models, or a transfer learning approach is not well studied, and constitutes the focus of this study. Using the clinical challenge of predicting mortality and hospital length of stay on a Danish trauma dataset, we hypothesized that a transfer learning approach of models trained on large external datasets would provide optimal prediction results compared to de-novo training on sparse but local datasets or directly porting externally trained models. Using an external dataset of trauma patients from the US Trauma Quality Improvement Program (TQIP) and a local dataset aggregated from the Danish Trauma Database (DTD) enriched with Electronic Health Record data, we tested a range of model-level approaches focused on predicting trauma mortality and hospital length of stay on DTD data. Modeling approaches included de-novo training of models on DTD data, direct porting of models trained on TQIP data to the DTD, and a transfer learning approach by training a model on TQIP data with subsequent transfer and retraining on DTD data. Furthermore, data-level approaches, including mixed dataset training and methods countering imbalanced outcomes (e.g., low mortality rates), were also tested. Using a neural network trained on a mixed dataset consisting of a subset of TQIP and DTD, with class weighting and transfer learning (retraining on DTD), we achieved excellent results in predicting mortality, with a ROC-AUC of 0.988 and an F2-score of 0.866. The best-performing models for predicting long-term hospitalization were trained only on local data, achieving an ROC-AUC of 0.890 and an F1-score of 0.897, although only marginally better than alternative approaches. Our results suggest that when assessing the optimal modeling approach, it is important to have domain knowledge of how incidence rates and workflows compare between hospital systems and datasets where models are trained. Including data from other health-care systems is particularly beneficial when outcomes are suffering from class imbalance and low incidence. Scenarios where outcomes are not directly comparable are best addressed through either de-novo local training or a transfer learning approach.

Read full abstract

Sparse Datasets Research Articles

Related Topics

Articles published on Sparse Datasets

GrapHiC: An integrative graph based approach for imputing missing Hi-C reads.

Extended Features Based Random Vector Functional Link Network for Classification Problem

Assessment of Domestic Water Resources for Sustainable Utilization Using Geospatial Techniques. The Case of Pune City, India

Pragmatic degradation learning for scene text image super-resolution with data-training strategy

A Comprehensive Evaluation of Generalizability of Deep Learning-Based Hi-C Resolution Improvement Methods.

Improving Landslide Prediction: Innovative Modeling and Evaluation of Landslide Scenario with Knowledge Graph Embedding

Identifying accurate artefact morphological ranges using optimal linear estimation: Method validation, case studies, and code

High-resolution reconstruction of turbulent flames from sparse data with physics-informed neural networks

Communities in C. elegans connectome through the prism of non-backtracking walks

Unprecedented distribution data for Joshua trees (Yucca brevifolia and Y. jaegeriana) reveal contemporary climate associations of a Mojave Desert icon

Multi-hop path reasoning over sparse temporal knowledge graphs based on path completion and reward shaping

Assessing the potential of synthetic and ex situ airborne laser scanning and ground plot data to train forest biomass models

Solving the imbalanced data issue: automatic urgency detection for instructor assistance in MOOC discussion forums

Performance Analysis of Sketching Methods

ScFed: federated learning for cell type classification with scRNA-seq.

Sparse convolutional neural network for high-resolution skull shape completion and shape super-resolution

Stochastic Gradient Descent for matrix completion: Hybrid parallelization on shared- and distributed-memory systems

Pixel-Level Degradation for Text Image Super-Resolution and Recognition

Language, artificial education, and future-making in indigenous language education

Assessing optimal methods for transferring machine learning models to low-volume and imbalanced clinical datasets: experiences from predicting outcomes of Danish trauma patients.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Sparse Datasets Research Articles

Related Topics

Articles published on Sparse Datasets

GrapHiC: An integrative graph based approach for imputing missing Hi-C reads.

Extended Features Based Random Vector Functional Link Network for Classification Problem

Assessment of Domestic Water Resources for Sustainable Utilization Using Geospatial Techniques. The Case of Pune City, India

Pragmatic degradation learning for scene text image super-resolution with data-training strategy

A Comprehensive Evaluation of Generalizability of Deep Learning-Based Hi-C Resolution Improvement Methods.

Improving Landslide Prediction: Innovative Modeling and Evaluation of Landslide Scenario with Knowledge Graph Embedding

Identifying accurate artefact morphological ranges using optimal linear estimation: Method validation, case studies, and code

High-resolution reconstruction of turbulent flames from sparse data with physics-informed neural networks

Communities in C. elegans connectome through the prism of non-backtracking walks

Unprecedented distribution data for Joshua trees (Yucca brevifolia and Y. jaegeriana) reveal contemporary climate associations of a Mojave Desert icon

Multi-hop path reasoning over sparse temporal knowledge graphs based on path completion and reward shaping

Assessing the potential of synthetic and ex situ airborne laser scanning and ground plot data to train forest biomass models

Solving the imbalanced data issue: automatic urgency detection for instructor assistance in MOOC discussion forums

Performance Analysis of Sketching Methods

ScFed: federated learning for cell type classification with scRNA-seq.

Sparse convolutional neural network for high-resolution skull shape completion and shape super-resolution

Stochastic Gradient Descent for matrix completion: Hybrid parallelization on shared- and distributed-memory systems

Pixel-Level Degradation for Text Image Super-Resolution and Recognition

Language, artificial education, and future-making in indigenous language education

Assessing optimal methods for transferring machine learning models to low-volume and imbalanced clinical datasets: experiences from predicting outcomes of Danish trauma patients.