An Approach of Epistasis Detection Using Integer Linear Programming Optimizing Bayesian Network
IEEE/ACM Transactions on Computational Biology and Bioinformatics(2022)
摘要
Proposing a more effective and accurate epistatic loci detection method in large-scale genomic data has important research significance for improving crop quality, disease treatment,
etc
. Due to the characteristics of high accuracy and processing non-linear relationship, Bayesian network (
BN
) has been widely used in constructing the network of SNPs and phenotype traits and thus to mine epistatic loci. However, the shortcoming of
BN
is that it is easy to fall into local optimum and unable to process large-scale of SNPs. In this work, we transform the problem of learning Bayesian network into the optimization of integer linear programming (
ILP
). We use the algorithms of branch-and-bound and cutting planes to get the global optimal Bayesian network (
ILPBN
), and thus to get epistatic loci influencing specific phenotype traits. In order to handle large-scale of SNP loci and further to improve efficiency, we use the method of optimizing Markov blanket to reduce the number of candidate parent nodes for each node. In addition, we use
α-BIC
that is suitable for processing the epistatis mining to calculate the
BN
score. We use four properties of
BN
decomposable scoring functions to further reduce the number of candidate parent sets for each node. Experiment results show that
ILPBN
can not only process 2-locus and 3-locus epistasis mining, but also realize multi-locus epistasis detection. Finally, we compare
ILPBN
with several popular epistasis mining algorithms by using simulated and real Age-related macular disease (AMD) dataset. Experiment results show that
ILPBN
has better epistasis detection accuracy, F1-score and false positive rate in premise of ensuring the efficiency compared with other methods. Availability: Codes and dataset are available at:
http://122.205.95.139/ILPBN/
.
更多查看译文
关键词
Epistasis,integer linear programming,bayesian network,markov blanket,multi-locus
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要