Principal components analysis for right censored data

STATISTICA SINICA(2023)

引用 0|浏览10
暂无评分
摘要
Principal components analysis (PCA) is a common dimension-reduction tool that transforms a set of variables into a linearly uncorrelated set of variables. Standard PCA estimators involve either the eigendecomposition of the estimated covariance matrix or a singular value decomposition of the centered data. However, for right-censored failure time data, estimating the principal components in this way is not straightforward because not all failure times are observed. Standard estimators for the covariance or correlation matrix should not be used in this case, because they require strong assumptions on the form of the joint distribution and on the marginal distributions beyond the final observation time. We present a novel, nonparametric estimator for the covariance of multivariate right-censored failure time data based on the counting processes and corresponding martingales defined by the failure times. We prove that these estimators are consistent and converge to a Gaussian process when properly standardized. We further show that these covariance estimates can be used to estimate a PCA for the martingales and counting processes for the different failure times. The corresponding estimates of the principal directions are consistent and asymptotically normal. We apply this method to data from a clinical trial of patients with pancreatic cancer, and recover a medically valid low-dimensional representation of adverse events.
更多
查看译文
关键词
Competing risks,multivariate survival analysis,principal components analysis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要