Interactive exploration of parameter space in data mining: Comprehending the predictive quality of large decision tree collections.

Computers & Graphics(2014)

引用 21|浏览58
暂无评分
摘要
Decision trees are an intuitive yet powerful tool for performing predictive data analysis in data mining. In order to generate an adequate predictive model from a data set, a data analyst has to assess the predictive quality of the decision trees derived from several combinations of working parameters. Except in very simple cases, this may be a tedious and error prone supervised task, since the parameter space is frequently huge. Analysts rely on their intuition and usually test just a few different parameter settings. In this work we present an interactive approach to facilitate the comprehension of the predictive power of large collections of decision trees by exploring large portions of the parameter space. For this, we developed novel views that allow us to visualize and analyze the predictive quality of hundreds of trees, working together with coordinated multiple views of tree representations (needed to understand the tree shapes and actual information herein), and aggregates of Receiver Operating Characteristic (ROC) and lift curves for assessing the predictive quality of the models. We developed a worked example using a data set from a Telecommunications company, showing how easy and natural it is to gain insight into the behavior of the data within our exploration tool, as compared with the traditional and widespread common practice of data analysts.
更多
查看译文
关键词
Decision trees,Parameter space exploration,Visual analytics,Knowledge discovery
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要