Abstract
PurposeTo describe the prevalence of missing sociodemographic data in the IRIS® Registry (Intelligent Research for Sight) and to identify practice-level characteristics associated with missing sociodemographic data. DesignCross-sectional study ParticipantsAll patients with clinical encounters at practices participating in the IRIS Registry prior to 12/31/2020. MethodsWe describe geographic and temporal trends in the prevalence of missing data for each sociodemographic variable (age, sex, race, ethnicity, geographic location, insurance type, and smoking status). Each practice contributing data to the registry was categorized based on the number of patients, number of physicians, geographic location, patient visit frequency, and patient population demographics. Main outcome measureMultivariable linear regression was used to describe the association of practice-level characteristics with missing patient-level sociodemographic data. ResultsThis study included the electronic health records of 66,477,365 patients receiving care at 3306 practices participating in the IRIS Registry. The median number of patients per practice was 11,415 (IQR: 5,849-24,148) and the median number of physicians per practice was 3 (IQR: 1-7). The prevalence of missing patient sociodemographic data was 0.1% for age, 0.4% for sex, 24.8% for race, 30.2% for ethnicity, 2.3% for 3-digit zip code, 14.8% for state, 5.5% for smoking status, and 17.0% for insurance type. The prevalence of missing data increased over time and varied at the state-level. Missing race data was associated with practices that had fewer visits per patient (p<0.001), cared for a larger non-privately insured patient population (p=0.001), and were located in urban areas (p<0.001). Frequent patient visits were associated with a lower prevalence of missing race (p<0.001), ethnicity (p<0.001), and insurance (p<0.001), but a higher prevalence of missing smoking status (p<0.001). ConclusionsThere are geographic and temporal trends in missing race, ethnicity, and insurance type data in the IRIS Registry. Several practice-level characteristics, including practice size, geographic location, and patient population are associated with missing sociodemographic data. While the prevalence and patterns of missing data may change in future versions of the IRIS registry, there will remain a need to develop standardized approaches for minimizing potential sources of bias and ensure reproducibility across research studies.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have