Machine Learning Models Research Articles

A large language model (LLM) is a machine learning model inferred from text data that captures subtle patterns of language use in context. Modern LLMs are based on neural network architectures that incorporate transformer methods. They allow the model to relate words together through attention to multiple words in a text sequence. LLMs have been shown to be highly effective for a range of tasks in natural language processing (NLP), including classification and information extraction tasks and generative applications. The aim of this adapted Delphi study was to collect researchers' opinions on how LLMs might influence health care and on the strengths, weaknesses, opportunities, and threats of LLM use in health care. We invited researchers in the fields of health informatics, nursing informatics, and medical NLP to share their opinions on LLM use in health care. We started the first round with open questions based on our strengths, weaknesses, opportunities, and threats framework. In the second and third round, the participants scored these items. The first, second, and third rounds had 28, 23, and 21 participants, respectively. Almost all participants (26/28, 93% in round 1 and 20/21, 95% in round 3) were affiliated with academic institutions. Agreement was reached on 103 items related to use cases, benefits, risks, reliability, adoption aspects, and the future of LLMs in health care. Participants offered several use cases, including supporting clinical tasks, documentation tasks, and medical research and education, and agreed that LLM-based systems will act as health assistants for patient education. The agreed-upon benefits included increased efficiency in data handling and extraction, improved automation of processes, improved quality of health care services and overall health outcomes, provision of personalized care, accelerated diagnosis and treatment processes, and improved interaction between patients and health care professionals. In total, 5 risks to health care in general were identified: cybersecurity breaches, the potential for patient misinformation, ethical concerns, the likelihood of biased decision-making, and the risk associated with inaccurate communication. Overconfidence in LLM-based systems was recognized as a risk to the medical profession. The 6 agreed-upon privacy risks included the use of unregulated cloud services that compromise data security, exposure of sensitive patient data, breaches of confidentiality, fraudulent use of information, vulnerabilities in data storage and communication, and inappropriate access or use of patient data. Future research related to LLMs should not only focus on testing their possibilities for NLP-related tasks but also consider the workflows the models could contribute to and the requirements regarding quality, integration, and regulations needed for successful implementation in practice.

BackgroundRadiation induced acute skin toxicity (AST) is considered as a common side effect of breast radiation therapy. The goal of this study was to design dosiomics-based machine learning (ML) models for prediction of AST, to enable creating optimized treatment plans for high-risk individuals.MethodsDosiomics features extracted using Pyradiomics tool (v3.0.1), along with treatment plan-derived dose volume histograms (DVHs), and patient-specific treatment-related (PTR) data of breast cancer patients were used for modeling. Clinical scoring was done using the Common Terminology Criteria for Adverse Events (CTCAE) V4.0 criteria for skin-specific symptoms. The 52 breast cancer patients were grouped into AST 2 + (CTCAE ≥ 2) and AST 2 − (CTCAE < 2) toxicity grades to facilitate AST modeling. They were randomly divided into training (70%) and testing (30%) cohorts. Multiple prediction models were assessed through multivariate analysis, incorporating different combinations of feature groups (dosiomics, DVH, and PTR) individually and collectively. In total, seven unique combinations, along with seven classification algorithms, were considered after feature selection. The performance of each model was evaluated on the test group using the area under the receiver operating characteristic curve (AUC) and f1-score. Accuracy, precision, and recall of each model were also studied. Statistical analysis involved features differences between AST 2 − and AST 2 + groups and cutoff value calculations.ResultsResults showed that 44% of the patients developed AST 2 + after Tomotherapy. The dosiomics (DOS) model, developed using dosiomics features, exhibited a noteworthy improvement in AUC (up to 0.78), when spatial information is preserved in the dose distribution, compared to DVH features (up to 0.71). Furthermore, a baseline ML model created using only PTR features for comparison with DOS models showed the significance of dosiomics in early AST prediction. By employing the Extra Tree (ET) classifiers, the DOS + DVH + PTR model achieved a statistically significant improved performance in terms of AUC (0.83; 95% CI 0.71–0.90), accuracy (0.70), precision (0.74) and sensitivity (0.72) compared to other models.ConclusionsThis study confirmed the benefit of dosiomics-based ML in the prediction of AST. However, the combination of dosiomics, DVH, and PTR yields significant improvement in AST prediction. The results of this study provide the opportunity for timely interventions to prevent the occurrence of radiation induced AST.

Machine Learning Models Research Articles

Related Topics

Articles published on Machine Learning Models

Improving second-order Møller-Plesset perturbation theory for noncovalent interactions with the machine learning-corrected abinitio dispersion potential.

Potential of Large Language Models in Health Care: Delphi Study.

Effect of Dynamic and Preferential Decoration of Pt Catalyst Surfaces by WOx on Hydrodeoxygenation Reactions.

Establishing a machine learning model based on dual-energy CT enterography to evaluate Crohn’s disease activity

Combining spectrum, thermal, and texture features using machine learning algorithms for wheat nitrogen nutrient index estimation and model transferability analysis

Evaluating the Performance and Challenges of Machine Learning Models in Network Anomaly Detection

Downscaling Future Precipitation over Mi Oya River Basin using Artificial Neural Networks

Fast Exploring Literature by Language Machine Learning for Perovskite Solar Cell Materials Design

Deep convolutional neural network based synthetic minority over sampling technique: a forfending model for fraudulent credit card transactions in financial institution

A dosiomics model for prediction of radiation-induced acute skin toxicity in breast cancer patients: machine learning-based study for a closed bore linac

Improving Targeted Mass Spectrometry Data Analysis with Nested Active Machine Learning

Unlocking the complete blood count as a risk stratification tool for breast cancer using machine learning: a large scale retrospective study

Drug Burden Index is a Modifiable Predictor of 30-Day-Hospitalization in Community-Dwelling Older Adults with Complex Care Needs: Machine Learning Analysis of InterRAI Data.

Enhanced Bearing Fault Diagnosis Through Trees Ensemble Method and Feature Importance Analysis

Advanced tree-based machine learning methods for predicting the seismic response of regular and irregular RC frames

Improve robustness of machine learning via efficient optimization and conformal prediction

Reproducible Radiomics Features from Multi-MRI-Scanner Test-Retest-Study: Influence on Performance and Generalizability of Models.

Automatic evolutionary design of quantum rule-based systems and applications to quantum reinforcement learning

Ab Initio Design of Ni‐Rich Cathode Material with Assistance of Machine Learning for High Energy Lithium‐Ion Batteries

Hybrid feature selection in a machine learning predictive model for perioperative myocardial injury in noncoronary cardiac surgery with cardiopulmonary bypass.

Lead the way for us