Abstract

Background: There remain many unknowns regarding the natural history, onset, distribution and both the individual and population burden of the ongoing COVID-19 pandemic associated with the spread of the SARS-CoV-2 virus. Here, we used a combination of classic epidemiological methods, natural language processing (NLP), and machine learning (for predictive modelling), to analyse the clinical information in the electronic health records (EHRs) of patients with COVID-19. This approach holds the potential to better define the disease and its associated outcomes, most notably ICU admission. Methods: This is a multicentre, non-interventional, retrospective study using the unstructured free-text clinical information captured in the EHRs of the participating hospital sites within the SESCAM Healthcare Network (Castilla La-Mancha, Spain, with 2.035 M inhabitants). We collected clinical information from the entire population with available EHRs (1,364,924 patients) for the period comprised between January 1, 2020 and March 29, 2020. Following identification of all COVID-19 cases seen in hospitals and primary care settings (all departments), we extracted related information upon diagnosis (including demographic characteristics, symptoms upon diagnosis, and other clinical information) and during disease progression and outcome (admission, discharge, and ICU admission). A data-driven analysis explored the minimum set of clinical variables associated with requiring ICU admission. Findings: A total of 10,504 patients with a clinical or PCR-confirmed diagnosis of COVID-19 were identified, 52.5% males, with a mean±SD age of 58.2±19.7 years, and age distribution ranging from 39oC/102oF without respiratory crackles), were free of ICU admission. On the contrary, COVID-19 patients aged 40 to 79 years were likely to be admitted to the ICU if they had tachypnoea and delayed their visit to the ER after being seen in primary care. Interpretation: Our results show that a combination of easily obtained clinical variables (age, fever, and tachypnoea with/without respiratory crackles) predicts which COVID-19 patients require ICU admission. Funding Statement: This study was sponsored by SAVANA (https://www.savanamed.com/) Declaration of Interests: None to declare Ethics Approval Statement: This study was classified as a ‘non-post-authorization study’ (EPA) by the Spanish Agency of Medicines and Health Products (AEMPS), and it was approved by the Research Ethics Committee at the University Hospital of Guadalajara (Spain).

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call