The validity of electronic health data for measuring smoking status: a systematic review and meta-analysis

Md Ashiqul Haque,Muditha Lakmali Bodawatte Gedara,Nathan Nickel,Maxime Turgeon,Lisa M Lix

doi:10.1186/s12911-024-02416-3

Md Ashiqul Haque, Muditha Lakmali Bodawatte Gedara + Show 3 more

Open Access

https://doi.org/10.1186/s12911-024-02416-3

Copy DOI

Abstract

BackgroundSmoking is a risk factor for many chronic diseases. Multiple smoking status ascertainment algorithms have been developed for population-based electronic health databases such as administrative databases and electronic medical records (EMRs). Evidence syntheses of algorithm validation studies have often focused on chronic diseases rather than risk factors. We conducted a systematic review and meta-analysis of smoking status ascertainment algorithms to describe the characteristics and validity of these algorithms.MethodsThe Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines were followed. We searched articles published from 1990 to 2022 in EMBASE, MEDLINE, Scopus, and Web of Science with key terms such as validity, administrative data, electronic health records, smoking, and tobacco use. The extracted information, including article characteristics, algorithm characteristics, and validity measures, was descriptively analyzed. Sources of heterogeneity in validity measures were estimated using a meta-regression model. Risk of bias (ROB) in the reviewed articles was assessed using the Quality Assessment of Diagnostic Accuracy Studies-2 tool.ResultsThe initial search yielded 2086 articles; 57 were selected for review and 116 algorithms were identified. Almost three-quarters (71.6%) of algorithms were based on EMR data. The algorithms were primarily constructed using diagnosis codes for smoking-related conditions, although prescription medication codes for smoking treatments were also adopted. About half of the algorithms were developed using machine-learning models. The pooled estimates of positive predictive value, sensitivity, and specificity were 0.843, 0.672, and 0.918 respectively. Algorithm sensitivity and specificity were highly variable and ranged from 3 to 100% and 36 to 100%, respectively. Model-based algorithms had significantly greater sensitivity (p = 0.006) than rule-based algorithms. Algorithms for EMR data had higher sensitivity than algorithms for administrative data (p = 0.001). The ROB was low in most of the articles (76.3%) that underwent the assessment.ConclusionsMultiple algorithms using different data sources and methods have been proposed to ascertain smoking status in electronic health data. Many algorithms had low sensitivity and positive predictive value, but the data source influenced their validity. Algorithms based on machine-learning models for multiple linked data sources have improved validity.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The validity of electronic health data for measuring smoking status: a systematic review and meta-analysis

Abstract

Talk to us

Similar Papers

More From: BMC medical informatics and decision making

Lead the way for us

Journal: BMC medical informatics and decision making	Publication Date: Feb 2, 2024
License type: CC BY 4.0

Similar Papers

Chronic Disease Case Definitions for Electronic Medical Records: A Canadian Validation Study
Lisa Lix ... Alan Katz
International Journal of Population Data Science | VOL. 1
Lisa Lix, et. al.Lisa Lix ... Alan Katz
18 Apr 2017
International Journal of Population Data Science | VOL. 1

A Systematic Review and Scoping Analysis of Smoking Cessation after a Urological Cancer Diagnosis
Calvin Zhao ... Michael Rink
Journal of Urology | VOL. 205
Calvin Zhao, et. al.Calvin Zhao ... Michael Rink
12 Feb 2021
Journal of Urology | VOL. 205

Can Linked Electronic Medical Record and Administrative Data Help Us Identify Those Living with Frailty?
Sabrina Wong ... Tyler Williamson
International journal of population data science | VOL. 5
Sabrina Wong, et. al.Sabrina Wong ... Tyler Williamson
14 Oct 2020
International journal of population data science | VOL. 5

Illustrating the patient journey through the care continuum: Leveraging structured primary care electronic medical record (EMR) data in Ontario, Canada using chronic obstructive pulmonary disease as a case study
Jennifer Rayner ... Chen Wu
International Journal of Medical Informatics | VOL. 140
Jennifer Rayner, et. al.Jennifer Rayner ... Chen Wu
19 May 2020
International Journal of Medical Informatics | VOL. 140

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The validity of electronic health data for measuring smoking status: a systematic review and meta-analysis

Abstract

Talk to us

Similar Papers

More From: BMC medical informatics and decision making