Real-world Knowledge Research Articles

The cost of health care in many countries is increasing rapidly. There is a growing interest in using machine learning for predicting high health care utilizers for population health initiatives. Previous studies have focused on individuals who contribute to the highest financial burden. However, this group is small and represents a limited opportunity for long-term cost reduction. We developed a collection of models that predict future health care utilization at various thresholds. We utilized data from a multi-institutional diabetes database from the year 2019 to develop binary classification models. These models predict health care utilization in the subsequent year across 6 different outcomes: patients having a length of stay of ≥7, ≥14, and ≥30 days and emergency department attendance of ≥3, ≥5, and ≥10 visits. To address class imbalance, random and synthetic minority oversampling techniques were employed. The models were then applied to unseen data from 2020 and 2021 to predict health care utilization in the following year. A portfolio of performance metrics, with priority on area under the receiver operating characteristic curve, sensitivity, and positive predictive value, was used for comparison. Explainability analyses were conducted on the best performing models. When trained with random oversampling, 4 models, that is, logistic regression, multivariate adaptive regression splines, boosted trees, and multilayer perceptron consistently achieved high area under the receiver operating characteristic curve (>0.80) and sensitivity (>0.60) across training-validation and test data sets. Correcting for class imbalance proved critical for model performance. Important predictors for all outcomes included age, number of emergency department visits in the present year, chronic kidney disease stage, inpatient bed days in the present year, and mean hemoglobin A1c levels. Explainability analyses using partial dependence plots demonstrated that for the best performing models, the learned patterns were consistent with real-world knowledge, thereby supporting the validity of the models. We successfully developed machine learning models capable of predicting high service level utilization with strong performance and valid explainability. These models can be integrated into wider diabetes-related population health initiatives.

Read full abstract

The integrity and reliability of clinical research outcomes rely heavily on access to vast amounts of data. However, the fragmented distribution of these data across multiple institutions, along with ethical and regulatory barriers, presents significant challenges to accessing relevant data. While federated learning offers a promising solution to leverage insights from fragmented data sets, its adoption faces hurdles due to implementation complexities, scalability issues, and inclusivity challenges. This paper introduces Federated Learning for Everyone (FL4E), an accessible framework facilitating multistakeholder collaboration in clinical research. It focuses on simplifying federated learning through an innovative ecosystem-based approach. The "degree of federation" is a fundamental concept of FL4E, allowing for flexible integration of federated and centralized learning models. This feature provides a customizable solution by enabling users to choose the level of data decentralization based on specific health care settings or project needs, making federated learning more adaptable and efficient. By using an ecosystem-based collaborative learning strategy, FL4E encourages a comprehensive platform for managing real-world data, enhancing collaboration and knowledge sharing among its stakeholders. Evaluating FL4E's effectiveness using real-world health care data sets has highlighted its ecosystem-oriented and inclusive design. By applying hybrid models to 2 distinct analytical tasks-classification and survival analysis-within real-world settings, we have effectively measured the "degree of federation" across various contexts. These evaluations show that FL4E's hybrid models not only match the performance of fully federated models but also avoid the substantial overhead usually linked with these models. Achieving this balance greatly enhances collaborative initiatives and broadens the scope of analytical possibilities within the ecosystem. FL4E represents a significant step forward in collaborative clinical research by merging the benefits of centralized and federated learning. Its modular ecosystem-based design and the "degree of federation" feature make it an inclusive, customizable framework suitable for a wide array of clinical research scenarios, promising to revolutionize the field through improved collaboration and data use. Detailed implementation and analyses are available on the associated GitHub repository.

Read full abstract

Real-world Knowledge Research Articles

Related Topics

Articles published on Real-world Knowledge

The role of alternative education to Students’ holistic learning: A case of Tanzanian schools in Morogoro

SSQ[formula omitted]: A Subgraph-based Semantic Query Approach for Temporal Knowledge Graph

Job Opportunities in Various Field for Criminology Graduates: Indian Perspective

Machine Learning-Based Prediction for High Health Care Utilizers by Using a Multi-Institutional Diabetes Registry: Model Training and Evaluation.

A Statistical Analysis of Knowledge Graph and its Applications

Machine Translation Meta Evaluation through Translation Accuracy Challenge Sets

Defining College Student Financial Literacy Utilizing the Delphi Method

Template-Based Contrastive Distillation Pretraining for Math Word Problem Solving.

Manufacturing resilience through disruption mitigation using attention-based consistently-attributed graph embedded decision support system

Accessible Ecosystem for Clinical Research (Federated Learning for Everyone): Development and Usability Study.

Language models, like humans, show content effects on reasoning tasks.

GLaM: Fine-Tuning Large Language Models for Domain Knowledge Graph Alignment via Neighborhood Partitioning and Generative Subgraph Encoding

Fuser: An enhanced multimodal fusion framework with congruent reinforced perceptron for hateful memes detection

A comparison of chain-of-thought reasoning strategies across datasets and models

Zero-Shot Construction of Chinese Medical Knowledge Graph with GPT-3.5-turbo and GPT-4

Bridging Gaps: Pre-Service Mathematics Teachers’ Handling the Difficulties in Posing Real-World Mathematical Problems

Multi-Filter soft shrinkage network for knowledge graph embedding

What drives the automatic retrieval of real-world object size knowledge?

Anchoring Path for Inductive Relation Prediction in Knowledge Graphs

Spatiotemporal knowledge graph completion via diachronic and transregional word embedding

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Real-world Knowledge Research Articles

Related Topics

Articles published on Real-world Knowledge

The role of alternative education to Students’ holistic learning: A case of Tanzanian schools in Morogoro

SSQ[formula omitted]: A Subgraph-based Semantic Query Approach for Temporal Knowledge Graph

Job Opportunities in Various Field for Criminology Graduates: Indian Perspective

Machine Learning-Based Prediction for High Health Care Utilizers by Using a Multi-Institutional Diabetes Registry: Model Training and Evaluation.

A Statistical Analysis of Knowledge Graph and its Applications

Machine Translation Meta Evaluation through Translation Accuracy Challenge Sets

Defining College Student Financial Literacy Utilizing the Delphi Method

Template-Based Contrastive Distillation Pretraining for Math Word Problem Solving.

Manufacturing resilience through disruption mitigation using attention-based consistently-attributed graph embedded decision support system

Accessible Ecosystem for Clinical Research (Federated Learning for Everyone): Development and Usability Study.

Language models, like humans, show content effects on reasoning tasks.

GLaM: Fine-Tuning Large Language Models for Domain Knowledge Graph Alignment via Neighborhood Partitioning and Generative Subgraph Encoding

Fuser: An enhanced multimodal fusion framework with congruent reinforced perceptron for hateful memes detection

A comparison of chain-of-thought reasoning strategies across datasets and models

Zero-Shot Construction of Chinese Medical Knowledge Graph with GPT-3.5-turbo and GPT-4

Bridging Gaps: Pre-Service Mathematics Teachers’ Handling the Difficulties in Posing Real-World Mathematical Problems

Multi-Filter soft shrinkage network for knowledge graph embedding

What drives the automatic retrieval of real-world object size knowledge?

Anchoring Path for Inductive Relation Prediction in Knowledge Graphs

Spatiotemporal knowledge graph completion via diachronic and transregional word embedding