Disease progression based feature screening for ultrahigh-dimensional survival-associated biomarkers.

Statistics in medicine(2023)

引用 0|浏览7
暂无评分
摘要
The increased availability of ultrahigh-dimensional biomarker data and the high demand of identifying biomarkers importantly related to survival outcomes made feature screening methods commonplace in the analysis of cancer genome data. When survival outcomes include endpoints of overall survival (OS) and time-to-progression (TTP), a high concordance is typically found in both endpoints in cancer studies, namely, patients' OS would most likely be extended when tumour progression is delayed. Existing screening procedures are often performed on a single survival endpoint only and may result in biased selection of features for OS in ignorance of disease progression. We propose a novel feature screening method by incorporating information of TTP into the selection of important biomarker predictors for more accurate inference of OS subsequent to disease progression. The proposal is based on the rank of correlation between individual features and the conditional distribution of OS given observations of TTP. It is advantageous for its flexible model nature, which requires no marginal model assumption for each endpoint, and its minimal computational cost for implementation. Theoretical results show its ranking consistency, sure screening and false rate control properties. Simulation results demonstrate that the proposed screener leads to more accurate feature selection than the method without considering the prior observations of disease progression. An application to breast cancer genome data illustrates its practical utility and facilitates disease classification using selected biomarker predictors.
更多
查看译文
关键词
conditional survival function,correlation rank,dependent censoring,ranking consistency,sure independence screening,ultrahigh-dimensional data
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要