Using Machine Learning to identify microRNA biomarkers for predisposition to Huntington’s Disease

K Patel, C Sheridan,DP Shanley

Journal of Bioinformatics and Systems Biology(2022)

引用 0|浏览6
暂无评分
摘要
Background Huntington’s disease (HD) is an autosomal dominant disease which is triggered by a large expansion of CAG nucleotides in the HTT gene. While the CAG expansion linearly correlates with the age of disease onset in HD, twin-studies and cohorts of Juvenile Onset HD (JOHD) patients have shown other factors influence the progression of HD. Thus, it would be of interest to identify molecular biomarkers which indicate predisposition to the development of HD, and as microRNAs (miRNAs) circulate in bio-fluids they would be particularly useful biomarkers. We explored a large HD miRNA-mRNA expression dataset (GSE65776) to establish appropriate questions that could be addressed using Machine Learning (ML). We sought sets of features (mRNAs or miRNAs) to predict HD or WT samples from aged or young mouse cortex samples, and we asked if a set of features could predict predisposition to HD or WT genotypes by training models on aged samples and testing the models on young samples. Several models were created using ADAboost, ExtraTrees, GaussianNB and Random Forest, and the best performing models were further analysed using AUC curves and PCA plots. Finally, genes used to train our miRNA-based predisposition model were mined from HD patient bio-fluid samples. Results Our testing accuracies were between 66-100% and AUC scores were between 31-100%. We generated several excellent models with testing accuracies >80% and AUC scores >90%. We also identified homologues of mmu-miR-154-5p , mmu-miR-181a-5p , mmu-miR-212-3p, mmu-miR-378b, mmu-miR-382-5p and mmu-miR-770-5p from our miRNA-based predisposition model to be circulating in HD patient blood samples at p.values of <0.05. Conclusions We generated several age-based models which could differentiate between HD and WT samples, including an aged mRNA-based model with a 100% AUC score, an aged miRNA-based model with a 92% AUC score and an aged miRNA-based model with a 96% AUC score. We also identified several miRNAs used to train our miRNA-based predisposition model which were detectable in HD patient blood samples, which suggests they could be potential candidates for use as non-invasive biomarkers for HD research. ### Competing Interest Statement The authors have declared no competing interest.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要