Compact NSGA-II for Multi-objective Feature Selection

Sevil Zanjani Miyandoab,Shahryar Rahnamayan, Azam Asilian Bidgoli

IEEE International Conference on Systems, Man and Cybernetics(2024)

引用 0|浏览0
暂无评分
摘要
Feature selection is an expensive challenging task in machine learning and data mining aimed at removing irrelevant and redundant features. This contributes to an improvement in classification accuracy, as well as the budget and memory requirements for classification, or any other post-processing task conducted after feature selection. In this regard, we define feature selection as a multi-objective binary optimization task with the objectives of maximizing classification accuracy and minimizing the number of selected features. In order to select optimal features, we have proposed a binary Compact NSGA-II (CNSGA-II) algorithm. Compactness represents the population as a probability distribution to enhance evolutionary algorithms not only to be more memory-efficient but also to reduce the number of fitness evaluations. Instead of holding two populations during the optimization process, our proposed method uses several Probability Vectors (PVs) to generate new individuals. Each PV efficiently explores a region of the search space to find non-dominated solutions instead of generating candidate solutions from a small population as is the common approach in most evolutionary algorithms. To the best of our knowledge, this is the first compact multi-objective algorithm proposed for feature selection. The reported results for expensive optimization cases with a limited budget on five datasets show that the CNSGA-II performs more efficiently than the well-known NSGA-II method in terms of the hypervolume (HV) performance metric requiring less memory. The proposed method and experimental results are explained and analyzed in detail.
更多
查看译文
关键词
Multi-objective Feature Selection,Classification Accuracy,Optimization Process,Evolutionary Algorithms,Search Space,Multi-objective Optimization,Probability Vector,Redundant Features,Candidate Solutions,Improve Classification Accuracy,Multi-objective Algorithm,Machine Learning Tasks,NSGA-II Algorithm,Population Size,Training Set,Hyperparameters,Step Size,Real-valued,General Solution,Particle Swarm,Pareto Front,Compact Method,Real-world Datasets,Multi-objective Evolutionary Algorithms,Pareto Front Solutions,Premature Convergence,Classification Error,Population Of Solutions,Feature Subset,Large Step Size
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要