Genome-wide prediction of dominant and recessive neurodevelopmental disorder risk genes

biorxiv(2022)

Cited 3|Views23
No score
Abstract
Despite great progress in the identification of neurodevelopmental disorder (NDD) risk genes, there are thousands that remain to be discovered. Computational tools that provide accurate gene-level predictions of NDD risk can significantly reduce the costs and time needed to prioritize and discover novel NDD risk genes. Here, we first demonstrate that machine learning models trained solely on single-cell RNA-sequencing data from the developing human cortex can robustly predict genes implicated in autism spectrum disorder (ASD), developmental and epileptic encephalopathy (DEE), and developmental delay (DD). Strikingly, we find differences in gene expression patterns of genes with monoallelic and biallelic inheritance patterns. We then integrate these expression data with 300 orthogonal features in a semi-supervised machine learning framework (mantis-ml) to train inheritance-specific models for ASD, DEE, and DD. The models have high predictive power (AUCs: 0.84 to 0.95) and top-ranked genes were up to two-fold (monoallelic models) and six-fold (biallelic models) more enriched for high-confidence NDD risk genes than genic intolerance metrics. Across all models, genes in the top decile of predicted risk genes were 60 to 130 times more likely to have publications strongly linking them to the phenotype of interest in PubMed compared to the bottom decile. Collectively, this work provides highly robust novel NDD risk gene predictions that can complement large-scale gene discovery efforts and underscores the importance of incorporating inheritance into gene risk prediction tools (). ### Competing Interest Statement R.S.D., S.P., D.V., and A.W.Z. are current employees and/or stockholders of AstraZeneca. B.W., J.S.D., A.J.S., and C.S. declare no competing interests.
More
Translated text
Key words
genes,genome-wide
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined