Abstract PR10: Multiple Pathway Learning accurately predicts gene essentiality in the Cancer Cell Line Encyclopedia

Cancer Research(2015)

引用 0|浏览0
暂无评分
摘要
We applied biologically-motivated feature transformations coupled with established machine learning methods to predict gene essentiality in CCLE cell line models. By leveraging additional large datasets, such as The Cancer Genome Atlas PanCancer12 data and MSigDB pathway definitions, we improved the robustness and biological interpretability of our models. We developed a multi-pathway learning (MPL) approach that associates a genetic pathway from MSigDB with a distinct kernel for use in a multiple kernel learning setting. We evaluated the performance of MPL compared to several other regression methods including random forests, kernel ridge regression, and elastic net linear models, We combined multiple approaches using an ensemble technique on the diverse set of predictors. We found that the best performing method was an ensemble combining MPL and random forest predictions. Both models utilized features derived from both gene expression and copy number data, the latter of which were filtered to those predicted as driver events in prior pan-cancer studies. The ensemble method was a joint winner in the recent DREAM 9 gene essentiality prediction challenge. MPL also demonstrated merit as a feature selector when used with other downstream methods. The ensemble performed best at predicting the essentiality of genes involved in cell cycle control (cyclins and cyclin-dependent kinases), protein degradation (proteasome complex), cell proliferation signaling (sonic hedgehog, Aurora-B, RAC1), apoptosis (RB1,TP53) and hypoxia response (VEGF, VHL). Many of the key genes in those pathways are known to be drivers of cancer progression, suggesting our method9s utility as a biomarker for detecting key tumorigenic events. The advantage of MPL is that mechanistically coherent gene sets are automatically selected as high scoring pathway kernels (HSPKs). We investigated whether the HSPKs identify cellular processes relevant to the loss of key genes. To do this, we inspected the HSPKs for a few of the most abundantly mutated genes in cancer. The MPL predictor for TP53 included the targets of this transcription factor as well as HSPKs involved in apoptosis, a cellular process regulated by TP53. The retinoblastoma gene (RB1) MPL predictor included RB1 targets as well as HSPKs involved in the regulation of histone deacetylase (HDAC) that interacts with RB1 to suppress DNA synthesis. These findings suggest trends in the MPL results could reveal a pathway-level view of the synthetic lethal architecture of cells. Such a map, that links patterns of pathway expression to potential genetic vulnerabilities, could provide an invaluable tool for exploring new avenues to target cancer cells. Citation Format: Vladislav Uzunangelov, Evan Paull, Sahil Chopra, Daniel Carlin, Adrian Bivol, Kyle Ellrott, Kiley Graim, Yulia Newton, Sam Ng, Artem Sokolov, Joshua Stuart. Multiple Pathway Learning accurately predicts gene essentiality in the Cancer Cell Line Encyclopedia. [abstract]. In: Proceedings of the AACR Special Conference on Translation of the Cancer Genome; Feb 7-9, 2015; San Francisco, CA. Philadelphia (PA): AACR; Cancer Res 2015;75(22 Suppl 1):Abstract nr PR10.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要