Explorekit: Automatic Feature Generation And Selection

2016 IEEE 16th International Conference on Data Mining (ICDM)(2016)

引用 218|浏览521
暂无评分
摘要
Feature generation is one of the challenging aspects of machine learning. We present ExploreKit, a framework for automated feature generation. ExploreKit generates a large set of candidate features by combining information in the original features, with the aim of maximizing predictive performance according to user-selected criteria. To overcome the exponential growth of the feature space, ExploreKit uses a novel machine learning-based feature selection approach to predict the usefulness of new candidate features. This approach enables efficient identification of the new features and produces superior results compared to existing feature selection solutions. We demonstrate the effectiveness and robustness of our approach by conducting an extensive evaluation on 25 datasets and 3 different classification algorithms. We show that ExploreKit can achieve classification-error reduction of 20% overall.
更多
查看译文
关键词
automatic feature generation,automatic feature selection,ExploreKit,predictive performance maximization,user-selected criteria,machine learning-based feature selection,classification-error reduction,classification algorithms
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要