Handling missing covariates in observational studies: an illustration with the assessment of prognostic factors of survival outcomes in soft-tissue or visceral sarcomas in irradiated fields (SIF)

THERAPEUTIC ADVANCES IN MEDICAL ONCOLOGY(2024)

引用 0|浏览14
暂无评分
摘要
Background: Missing covariates are common in observational research and can lead to bias and loss of statistical power. Limited data regarding prognostic factors of survival outcomes of sarcomas in irradiated fields (SIF) are available. Because of the long lag time between irradiation of first cancer and scarcity of SIF, missing data are a critical issue when analyzing long-term outcomes. We assessed prognostic factors of overall (OS), progression-free (PFS), and metastatic-progression-free (MPFS) survivals in SIF using three methods to account for missing covariates.Methods: We relied on the NETSARC French Sarcoma Group database, Cox (OS/PFS), and competitive hazards (MPFS) survival models. Covariates investigated were age, sex, histological subtype, tumor size, depth and grade, metastasis, surgery, surgical resection, surgeon's expertise, imaging, and neo-adjuvant treatment. We first applied multiple imputation (MI): observed data were used to estimate the missing covariate. With the missing-data modality approach, a category missing was created for qualitative variables. With the complete-case (CC) approach, analysis was restricted to patients without missing covariates.Results: CC subjects (N = 167; 33%) presented more often with soft-tissue sarcoma (versus visceral sarcoma) and grade I-II tumors as compared to the 504 eligible cases. With MI (N = 504), factors associated with the worst outcome included metastasis (p = 0.04) and R1/R2 resection (p < 0.001) for OS; higher grade/non-gradable tumors (p = 0.002) and R1/R2 resection (p < 0.001) for PFS; and metastasis (p = 0.01) for M-PFS. The 'missing-data modality' approach (N = 504) led to different associations, including significance reached due to variables with the modality 'missing'. The CC analysis led to different results and reduced precision.Conclusion: The CC population was not representative of the eligible population, introducing bias, in addition to worst precision. The 'missing-data modality method' results in biased estimates in non-randomized studies, as outcomes may be related to variables with missing values. Appropriate statistical methods for missing covariates, for example, MI, should therefore be considered.
更多
查看译文
关键词
competing risks,irradiation,sarcoma,survival analysis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要