Abstract
IntroductionResearchers conducting studies based on electronic health records (EHRs) often have to deal with missing data. We aimed to analyze patterns of missing data in lipid profile, sociodemographic variables and risk factors contained in the EHRs of the CARDIABETES project and compare different strategies for addressing the issue. MethodsWe conducted a retrospective cohort study of people with diabetes, based on EHRs in the Spanish Pharmacoepidemiological Research Database for Public Health Systems (BIFAP). Our response variable was major adverse cardiovascular events (MACE), including all-cause death and hospital admission for cerebrovascular disease or ischemic heart disease. We analyzed patterns of missing data, associations between missingness and MACE, and the effect of eliminating cases with missing data or imputing missing data. ResultsOur total sample included 309,556 people with diabetes. The proportion of individuals with at least one missing value was 76.0%. Regarding diabetes control measures, 10.8% of records had missing glycated hemoglobin values, and 21.4% had missing basal blood glucose values. We observed a non-random pattern of association between missingness and MACE. The strategy of eliminating records with missing data greatly reduced the number of cases and statistical power, and altered the average participant characteristics and cumulative incidence of MACE. By imputing missing data, we were able to circumvent these problems. ConclusionA considerable proportion of missing data was observed for variables such as fasting blood glucose and glycated hemoglobin, and also for other variables such as blood test parameters, BMI, and tobacco and alcohol use. The missing data show a non-random pattern and are associated with a higher incidence of MACE. The strategy of eliminating records with missing data greatly reduced the number of cases and statistical power. The recommended solution is to impute missing data with methods that take all the variables into account, such as MICE with PPM.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.