Factors Associated with Missing Sociodemographic Data in the IRIS Registry

Ophthalmology Science(2024)

引用 0|浏览2
暂无评分
摘要
Purpose To describe the prevalence of missing sociodemographic data in the IRIS® Registry (Intelligent Research for Sight) and to identify practice-level characteristics associated with missing sociodemographic data. Design Cross-sectional study Participants All patients with clinical encounters at practices participating in the IRIS Registry prior to 12/31/2020. Methods We describe geographic and temporal trends in the prevalence of missing data for each sociodemographic variable (age, sex, race, ethnicity, geographic location, insurance type, and smoking status). Each practice contributing data to the registry was categorized based on the number of patients, number of physicians, geographic location, patient visit frequency, and patient population demographics. Main outcome measure Multivariable linear regression was used to describe the association of practice-level characteristics with missing patient-level sociodemographic data. Results This study included the electronic health records of 66,477,365 patients receiving care at 3306 practices participating in the IRIS Registry. The median number of patients per practice was 11,415 (IQR: 5,849-24,148) and the median number of physicians per practice was 3 (IQR: 1-7). The prevalence of missing patient sociodemographic data was 0.1% for age, 0.4% for sex, 24.8% for race, 30.2% for ethnicity, 2.3% for 3-digit zip code, 14.8% for state, 5.5% for smoking status, and 17.0% for insurance type. The prevalence of missing data increased over time and varied at the state-level. Missing race data was associated with practices that had fewer visits per patient (p<0.001), cared for a larger non-privately insured patient population (p=0.001), and were located in urban areas (p<0.001). Frequent patient visits were associated with a lower prevalence of missing race (p<0.001), ethnicity (p<0.001), and insurance (p<0.001), but a higher prevalence of missing smoking status (p<0.001). Conclusions There are geographic and temporal trends in missing race, ethnicity, and insurance type data in the IRIS Registry. Several practice-level characteristics, including practice size, geographic location, and patient population are associated with missing sociodemographic data. While the prevalence and patterns of missing data may change in future versions of the IRIS registry, there will remain a need to develop standardized approaches for minimizing potential sources of bias and ensure reproducibility across research studies.
更多
查看译文
关键词
IRIS registry,missing data,imputation,race,ethnicity,insurance,geographic location,sociodemographic,electronic health records
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要