Electronic Health Record Information Research Articles

Health-related social needs (HRSNs), such as housing instability, food insecurity, and financial strain, are increasingly prevalent among patients. Healthcare organizations must first correctly identify patients with HRSNs to refer them to appropriate services or offer resources to address their HRSNs. Yet, current identification methods are suboptimal, inconsistently applied, and cost prohibitive. Machine learning (ML) predictive modeling applied to existing data sources may be a solution to systematically and effectively identify patients with HRSNs. The performance of ML predictive models using data from electronic health records (EHRs) and other sources has not been compared to other methods of identifying patients needing HRSN services. A screening questionnaire that included housing instability, food insecurity, transportation barriers, legal issues, and financial strain was administered to adult ED patients at a large safety-net hospital in the mid-Western United States (n = 1,101). We identified those patients likely in need of HRSN-related services within the next 30 days using positive indications from referrals, encounters, scheduling data, orders, or clinical notes. We built an XGBoost classification algorithm using responses from the screening questionnaire to predict HRSN needs (screening questionnaire model). Additionally, we extracted features from the past 12 months of existing EHR, administrative, and health information exchange data for the survey respondents. We built ML predictive models with these EHR data using XGBoost (ML EHR model). Out of concerns of potential bias, we built both the screening question model and the ML EHR model with and without demographic features. Models were assessed on the validation set using sensitivity, specificity, and Area Under the Curve (AUC) values. Models were compared using the Delong test. Almost half (41%) of the patients had a positive indicator for a likely HRSN service need within the next 30 days, as identified through referrals, encounters, scheduling data, orders, or clinical notes. The screening question model had suboptimal performance, with an AUC = 0.580 (95%CI = 0.546, 0.611). Including gender and age resulted in higher performance in the screening question model (AUC = 0.640; 95%CI = 0.609, 0.672). The ML EHR models had higher performance. Without including age and gender, the ML EHR model had an AUC = 0.765 (95%CI = 0.737, 0.792). Adding age and gender did not improve the model (AUC = 0.722; 95%CI = 0.744, 0.800). The screening questionnaire models indicated bias with the highest performance for White non-Hispanic patients. The performance of the ML EHR-based model also differed by race and ethnicity. ML predictive models leveraging several robust EHR data sources outperformed models using screening questions only. Nevertheless, all models indicated biases. Additional work is needed to design predictive models for effectively identifying all patients with HRSNs.

Read full abstract

Long COVID is a debilitating multisystem condition. The objective of this study was to estimate the prevalence of long COVID in the adult population of Scotland, and to identify risk factors associated with its development. In this national, retrospective, observational cohort study, we analysed electronic health records (EHRs) for all adults (≥18 years) registered with a general medical practice and resident in Scotland between March 1, 2020, and October 26, 2022 (98-99% of the population). We linked data from primary care, secondary care, laboratory testing and prescribing. Four outcome measures were used to identify long COVID: clinical codes, free text in primary care records, free text on sick notes, and a novel operational definition. The operational definition was developed using Poisson regression to identify clinical encounters indicative of long COVID from a sample of negative and positive COVID-19 cases matched on time-varying propensity to test positive for SARS-CoV-2. Possible risk factors for long COVID were identified by stratifying descriptive statistics by long COVID status. Of 4,676,390 participants, 81,219 (1.7%) were identified as having long COVID. Clinical codes identified the fewest cases (n=1,092, 0.02%), followed by free text (n=8,368, 0.2%), sick notes (n=14,469, 0.3%), and the operational definition (n=64,193, 1.4%). There was limited overlap in cases identified by the measures; however, temporal trends and patient characteristics were consistent across measures. Compared with the general population, a higher proportion of people with long COVID were female (65.1% versus 50.4%), aged 38-67 (63.7% versus 48.9%), overweight or obese (45.7% versus 29.4%), had one or more comorbidities (52.7% versus 36.0%), were immunosuppressed (6.9% versus 3.2%), shielding (7.9% versus 3.4%), or hospitalised within 28 days of testing positive (8.8% versus 3.3%%), and had tested positive before Omicron became the dominant variant (44.9% versus 35.9%). The operational definition identified long COVID cases with combinations of clinical encounters (from four symptoms, six investigation types, and seven management strategies) recorded in EHRs within 4-26 weeks of a positive SARS-CoV-2 test. These combinations were significantly (p<0.0001) more prevalent in positive COVID-19 patients than in matched negative controls. In a case-crossover analysis, 16.4% of those identified by the operational definition had similar healthcare patterns recorded before testing positive. The prevalence of long COVID presenting in general practice was estimated to be 0.02-1.7%, depending on the measure used. Due to challenges in diagnosing long COVID and inconsistent recording of information in EHRs, the true prevalence of long COVID is likely to be higher. The operational definition provided a novel approach but relied on a restricted set of symptoms and may misclassify individuals with pre-existing health conditions. Further research is needed to refine and validate this approach. Chief Scientist Office (Scotland), Medical Research Council, and BREATHE.

Read full abstract

Electronic Health Record Information Research Articles

Related Topics

Articles published on Electronic Health Record Information

Natural language processing to identify suicidal ideation and anhedonia in major depressive disorder.

Toward a Computable Phenotype for Determining Eligibility of Lung Cancer Screening Using Electronic Health Records.

The association of distress and depression screening measures and other electronic health record information with adjuvant endocrine therapy persistence.

Comparing the performance of screening surveys versus predictive models in identifying patients in need of health-related social need services in the emergency department.

Optimizing Electronic Health Records for Enhanced Clinical Decision Support and Improved Patient Care through Advanced Data Integration and Machine Learning Algorithms

Perceptions of hospital electronic health record (EHR) training, support, and patient safety by staff position and tenure

The reuse of electronic health records information models in the oncology domain: Studies with the bioframe framework

Prevalence and risk factors for long COVID among adults in Scotland using electronic health records: a national, retrospective, observational cohort study

Mapping the Landscape of Electronic Health Records and Health Information Exchange Through Bibliometric Analysis and Visualization.

Enhancing healthcare decision support through explainable AI models for risk prediction

Securing the Patient’s Breast Cancer Data using Blockchain-based IBE with Deep Learning Model in IoT

420 Computable Phenotyping with “Big Data” as a Foundation for Artificial Intelligence Algorithm Construction: Puberty as a Transdisciplinary Case Example

Abstract 3569: Using AI to automatically process data from unstructured health records of patients with lung cancer

Characterisation and validation of lactation information from structured electronic health records for use in pharmacoepidemiological studies.

Towards Reducing Diagnostic Errors with Interpretable Risk Prediction.

Factors Associated with Online Patient Portal Utilization Experience in an Arkansas Phone Survey.

Trustworthy Data and AI Environments for Clinical Prediction: Application to Crisis-Risk in People with Depression.

Pay-for-performance for appropriate prescribing using routine healthcare data from general practices

Vulnerable Patients with Inflammatory Bowel Disease

EHR-KnowGen: Knowledge-enhanced multimodal learning for disease diagnosis generation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Electronic Health Record Information Research Articles

Related Topics

Articles published on Electronic Health Record Information

Natural language processing to identify suicidal ideation and anhedonia in major depressive disorder.

Toward a Computable Phenotype for Determining Eligibility of Lung Cancer Screening Using Electronic Health Records.

The association of distress and depression screening measures and other electronic health record information with adjuvant endocrine therapy persistence.

Comparing the performance of screening surveys versus predictive models in identifying patients in need of health-related social need services in the emergency department.

Optimizing Electronic Health Records for Enhanced Clinical Decision Support and Improved Patient Care through Advanced Data Integration and Machine Learning Algorithms

Perceptions of hospital electronic health record (EHR) training, support, and patient safety by staff position and tenure

The reuse of electronic health records information models in the oncology domain: Studies with the bioframe framework

Prevalence and risk factors for long COVID among adults in Scotland using electronic health records: a national, retrospective, observational cohort study

Mapping the Landscape of Electronic Health Records and Health Information Exchange Through Bibliometric Analysis and Visualization.

Enhancing healthcare decision support through explainable AI models for risk prediction

Securing the Patient’s Breast Cancer Data using Blockchain-based IBE with Deep Learning Model in IoT

420 Computable Phenotyping with “Big Data” as a Foundation for Artificial Intelligence Algorithm Construction: Puberty as a Transdisciplinary Case Example

Abstract 3569: Using AI to automatically process data from unstructured health records of patients with lung cancer

Characterisation and validation of lactation information from structured electronic health records for use in pharmacoepidemiological studies.

Towards Reducing Diagnostic Errors with Interpretable Risk Prediction.

Factors Associated with Online Patient Portal Utilization Experience in an Arkansas Phone Survey.

Trustworthy Data and AI Environments for Clinical Prediction: Application to Crisis-Risk in People with Depression.

Pay-for-performance for appropriate prescribing using routine healthcare data from general practices

Vulnerable Patients with Inflammatory Bowel Disease

EHR-KnowGen: Knowledge-enhanced multimodal learning for disease diagnosis generation