谷歌浏览器插件
订阅小程序
在清言上使用

Empowering rare variant burden-based gene-trait association studies via optimized computational predictor choice

bioRxiv (Cold Spring Harbor Laboratory)(2022)

引用 0|浏览14
暂无评分
摘要
Background Causal gene/trait relationships can be identified via observation of an excess (or reduced) burden of rare variation in a given gene within humans who have that trait. Although computational predictors can improve the power of such ‘burden’ tests, it is unclear which are optimal for this task. Method Using 140 gene-trait combinations with a reported rare-variant burden association, we evaluated the ability of 20 computational predictors to predict human traits. We used the best-performing predictors to increase the power of genome-wide rare variant burden scans based on ∼450K UK Biobank participants. Results Two predictors—VARITY and REVEL—outperformed all others in predicting human traits in the UK Biobank from missense variation. Genome-scale burden scans using the two best-performing predictors identified 1,038 gene-trait associations (FDR < 5%), including 567 (55%) that had not been previously reported. We explore 54 cardiovascular gene-trait associations (including 15 not reported in other burden scans) in greater depth. Conclusions Rigorous selection of computational missense variant effect predictors can improve the power of rare-variant burden scans for human gene-trait associations, yielding many new associations with potential value in informing mechanistic understanding and therapeutic development. The strategy we describe here is generalizable to future computational variant effect predictors, traits and organisms. ### Competing Interest Statement F.P.R.is a scientific advisor and shareholder for Constantiam Biosciences and BioSymetrics, and a Ranomics shareholder. The authors declare no other competing interests. * VUS : Variant of uncertain significance SNV : Single-nucleotide variant MAF : Minor allele frequency AUBPRC : Area under the balanced precision-recall curve FDR : False discovery rate PCC : Pearson correlation coefficient FID : Field ID LDL : Low-density lipoprotein FH : Familial hypercholesterolemia CI : Confidence interval BMI : Body mass index HGMD : Human Gene Mutation Database NIH : National Institutes of Health GO : Grand Opportunity NHLBI : National Heart, Lung, and Blood Institute WBBC : Westlate Biobank for Chinese
更多
查看译文
关键词
computational predictor choice,burden-based,gene-trait
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要