Predicting chemical activities from structures by attributed molecular graph classification

CIBCB(2010)

引用 1|浏览18
暂无评分
摘要
Designing Quantitative Structure-Activity Relationship (QSAR) models has been a recurrent research interest for biologists and computer scientists. An example is to predict the toxicity of chemical compounds using their structural properties as features represented by graphs. A popular method to classify these graphs is to exploit classifiers such as support vector machines (SVMs) and graph kernels to incorporate the sequential, structural and chemical information. Previous works have focused on designing specific graph kernels for this task, amongst which graph alignment kernels are one of the most popular approach. Graph alignment kernels align the nodes of one graph to the nodes of the second graph so that the total overall similarity is maximized with respect to all possible alignments. However, taking both vertex and edge similarities into account makes the problem NP-Hard. In this paper, we present a novel general graph-matching based method for QSAR. We view the problem of calculating optimal assignments of two attributed graphs from a different perspective. Instead of first designing an atom kernel function and a bond kernel function, we first provide a training set of pairs of graphs with their corresponding matchings. We then try to learn the compatibility function over atoms and use only the atom kernel function to compute graph matchings. Our algorithm has the advantage of being more general and yet efficient than previous approaches for the QSAR problem. We evaluate our method on a set of chemical structure-activity prediction benchmark datasets, and show that our algorithm can achieve better or comparable accuracies over the optimal assignment kernel method.
更多
查看译文
关键词
chemistry computing,np-hard problem,pattern classification,chemical activity prediction,structural information,attributed molecular graph classification,quantitative structure-activity relationship,graph alignment kernels,chemical information,atom kernel function,graph theory,bond kernel function,graph-matching,sequential information,support vector machines,decision trees,logistics,kernel function,chemical structure,kernel,training data,graph matching,quantitative structure activity relationship,support vector machine,algorithm design and analysis,chemicals,kernel method,np hard problem
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要