Score tests with incomplete covariates and high-dimensional auxiliary variables

Kin Yau Wong, Jiahui Feng

STATISTICA SINICA(2023)

引用 0|浏览2
暂无评分
摘要
Analyses of modern biomedical data are often complicated by missing values. When variables of interest are missing for some subjects, it is desirable to use observed auxiliary variables, which are sometimes high dimensional, to impute or predict the missing values in order to improve the statistical efficiency. Although many methods have been developed for prediction using high-dimensional variables, it is challenging to perform a valid inference based on such predicted values. In this study, we develop an association test for an outcome variable and a potentially missing covariate, where the covariate can be predicted using variables selected from a set of high-dimensional auxiliary variables. We establish the validity of the test under data-driven model-selection procedures. We also demonstrate the validity of the proposed method and its advantages over existing methods using extensive simulation studies and an application to a major cancer genomics study.
更多
查看译文
关键词
Association test,integrative analysis,missing data,post-selection inference,variable selection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要