Natural Language Processing Models Research Articles

Heart failure (HF) is a clinical syndrome with no definitive diagnostic tests. HF registries are often based on manual reviews of medical records of hospitalized HF patients identified using International Classification of Diseases (ICD) codes. However, most HF patients are not hospitalized, and manual review of big electronic health record (EHR) data is not practical. The US Department of Veterans Affairs (VA) has the largest integrated healthcare system in the nation, and an estimated 1.5 million patients have ICD codes for HF (HF ICD-code universe) in their VA EHR. The objective of our study was to develop artificial intelligence (AI) models to phenotype HF in these patients. The model development cohort (n=20000: training, 16000; validation 2000; testing, 2000) included 10000 patients with HF and 10000 without HF who were matched by age, sex, race, inpatient/outpatient status, hospital, and encounter date (within 60days). HF status was ascertained by manual chart reviews in VA's External Peer Review Program for HF (EPRP-HF) and non-HF status was ascertained by the absence of ICD codes for HF in VA EHR. Two clinicians annotated 1000 random snippets with HF-related keywords and labelled 436 as HF, which was then used to train and test a natural language processing (NLP) model to classify HF (positive predictive value or PPV, 0.81; sensitivity, 0.77). A machine learning (ML) model using linear support vector machine architecture was trained and tested to classify HF using EPRP-HF as cases (PPV, 0.86; sensitivity, 0.86). From the 'HF ICD-code universe', we randomly selected 200 patients (gold standard cohort) and two clinicians manually adjudicated HF (gold standard HF) in 145 of those patients by chart reviews. We calculated NLP, ML, and NLP+ML scores and used weighted F scores to derive their optimal threshold values for HF classification, which resulted in PPVs of 0.83, 0.77, and 0.85 and sensitivities of 0.86, 0.88, and 0.83, respectively. HF patients classified by the NLP+ML model were characteristically and prognostically similar to those with gold standard HF. All three models performed better than ICD code approaches: one principal hospital discharge diagnosis code for HF (PPV, 0.97; sensitivity, 0.21) or two primary outpatient encounter diagnosis codes for HF (PPV, 0.88; sensitivity, 0.54). These findings suggest that NLP and ML models are efficient AI tools to phenotype HF in big EHR data to create contemporary HF registries for clinical studies of effectiveness, quality improvement, and hypothesis generation.

7552 Background: Multiple myeloma (MM) is the most common type of plasma cell cancer in the U.S. MM is consistently proceeded by monoclonal gammopathy of undetermined significance (MGUS). Studies have identified several risk factors for the progression of MGUS to MM. However, the contribution of each of these risk factors has not been quantified. We computed the population attributable fractions (PAFs) for selected risk factors in the U.S. Veterans Health Administration (VHA) health system. Methods: Veterans diagnosed with MGUS from 1999-2021 were identified and confirmed via a published natural language processing (NLP) model. We included Black and White patients whose immunoglobulin (Ig) subtype was IgA, IgG, or light chain. Patients who progressed to MM ≤6 months after MGUS diagnosis were excluded. We fit a multivariable time-to-event model controlling for sex, race (Black, White), Ig subtype (IgA, IgG, light chain), agent orange (AO) exposure, as well as M-protein level (≤, >1.5 g/dL), Charlson Comorbidity Index (CCI), obesity status (underweight, normal weight, overweight, obese), and age, all at MGUS diagnosis. The outcome was progression of MGUS to MM, which was also confirmed by a published NLP model. PAF for a risk factor is the fraction of all MM cases in the veteran population that is attributable to this specific risk factor. It accounts for both the prevalence and relative risk of this factor in the population. We calculated PAF for black race, IgA, AO exposure, and overweight/obesity. Results: The analysis included 24,917 patients with MGUS. Among them, 7.8% (n=1,944) progressed to MM (follow-up median 5.6, IQR 3.3-8.8 years). In the veteran population with MGUS, 17.9% of all MM cases was attributable to overweight or obesity (PAF 17.9%, 95% confidence interval [CI] 11.0-24.3%); 7.6% was attributable to black race (PAF 7.6%, 95% CI 4.3-10.9%), 5.7% was attributable to IgA (PAF 5.7%, 95% CI 4.0-7.4%), and 2.4% was attributable to AO exposure (PAF 2.4%, 95% CI 1.0-3.9%). Conclusions: In the veteran population with MGUS, overweight/obesity, the only modifiable factor, is the top contributor to progression to MM cases. In veterans with MGUS, had the overweight or obese MGUS patients to be reversed to normal weight, an estimated 17.9% of the progression cases could have been avoided.Our findings highlight the importance of maintaining a normal weight for reducing the risk of progression of MGUS to MM. [Table: see text]

Natural Language Processing Models Research Articles

Related Topics

Articles published on Natural Language Processing Models

Understanding urban perception with visual data: A systematic review

The propensity for negative media reporting of the Murray-Darling Basin Plan in Australia

Artificial intelligence approaches for phenotyping heart failure in U.S. Veterans Health Administration electronic health record.

Towards human-AI collaboration in the competency-based curriculum development process: The case of industrial engineering and management education

A publishing infrastructure for Artificial Intelligence (AI)-assisted academic authoring.

Google Bard and ChatGPT in Orthopedics: Which Is the Better Doctor in Sports Medicine and Pediatric Orthopedics? The Role of AI in Patient Education.

USING NATURAL LANGUAGE PROCESSING (NLP) FOR CATEGORIZING PAPER TITLES FROM GOOGLE FORMS

Artificial intelligence and management education: A conceptualization of human-machine interaction

Natural language processing and stable diffusion model based graphical authentication using passphrase

Real-Time Stock Forecasting: Leveraging Live Data and Advanced Algorithms for Accurate Predictions

LLMs and NLP Models in Cryptocurrency Sentiment Analysis: A Comparative Classification Study

Evaluation of a BERT Natural Language Processing Model for Automating CT and MRI Triage and Protocol Selection.

BioInstruct: instruction tuning of large language models for biomedical natural language processing.

From text to model: Leveraging natural language processing for system dynamics model development

Enhancing Clinical Documentation with Synthetic Data: Leveraging Generative Models for Improved Accuracy

Natural language processing for automated breast cancer recurrence detection and classification in computed tomography reports.

Population attributable fractions for risk factors for the progression of monoclonal gammopathy of undetermined significance to multiple myeloma in the Veteran population.

Evaluating the machine learning models based on natural language processing tasks

Shakespeare Machine: New AI-Based Technologies for Textual Analysis

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Natural Language Processing Models Research Articles

Related Topics

Articles published on Natural Language Processing Models

Understanding urban perception with visual data: A systematic review

The propensity for negative media reporting of the Murray-Darling Basin Plan in Australia

Artificial intelligence approaches for phenotyping heart failure in U.S. Veterans Health Administration electronic health record.

Towards human-AI collaboration in the competency-based curriculum development process: The case of industrial engineering and management education

A publishing infrastructure for Artificial Intelligence (AI)-assisted academic authoring.

Google Bard and ChatGPT in Orthopedics: Which Is the Better Doctor in Sports Medicine and Pediatric Orthopedics? The Role of AI in Patient Education.

USING NATURAL LANGUAGE PROCESSING (NLP) FOR CATEGORIZING PAPER TITLES FROM GOOGLE FORMS

Artificial intelligence and management education: A conceptualization of human-machine interaction

Natural language processing and stable diffusion model based graphical authentication using passphrase

Real-Time Stock Forecasting: Leveraging Live Data and Advanced Algorithms for Accurate Predictions

LLMs and NLP Models in Cryptocurrency Sentiment Analysis: A Comparative Classification Study

Evaluation of a BERT Natural Language Processing Model for Automating CT and MRI Triage and Protocol Selection.

BioInstruct: instruction tuning of large language models for biomedical natural language processing.

From text to model: Leveraging natural language processing for system dynamics model development

Enhancing Clinical Documentation with Synthetic Data: Leveraging Generative Models for Improved Accuracy

Natural language processing for automated breast cancer recurrence detection and classification in computed tomography reports.

Population attributable fractions for risk factors for the progression of monoclonal gammopathy of undetermined significance to multiple myeloma in the Veteran population.

Evaluating the machine learning models based on natural language processing tasks

Shakespeare Machine: New AI-Based Technologies for Textual Analysis