Compact NSGA-II for Multi-objective Feature Selection
IEEE International Conference on Systems, Man and Cybernetics(2024)
摘要
Feature selection is an expensive challenging task in machine learning and
data mining aimed at removing irrelevant and redundant features. This
contributes to an improvement in classification accuracy, as well as the budget
and memory requirements for classification, or any other post-processing task
conducted after feature selection. In this regard, we define feature selection
as a multi-objective binary optimization task with the objectives of maximizing
classification accuracy and minimizing the number of selected features. In
order to select optimal features, we have proposed a binary Compact NSGA-II
(CNSGA-II) algorithm. Compactness represents the population as a probability
distribution to enhance evolutionary algorithms not only to be more
memory-efficient but also to reduce the number of fitness evaluations. Instead
of holding two populations during the optimization process, our proposed method
uses several Probability Vectors (PVs) to generate new individuals. Each PV
efficiently explores a region of the search space to find non-dominated
solutions instead of generating candidate solutions from a small population as
is the common approach in most evolutionary algorithms. To the best of our
knowledge, this is the first compact multi-objective algorithm proposed for
feature selection. The reported results for expensive optimization cases with a
limited budget on five datasets show that the CNSGA-II performs more
efficiently than the well-known NSGA-II method in terms of the hypervolume (HV)
performance metric requiring less memory. The proposed method and experimental
results are explained and analyzed in detail.
更多查看译文
关键词
Multi-objective Feature Selection,Classification Accuracy,Optimization Process,Evolutionary Algorithms,Search Space,Multi-objective Optimization,Probability Vector,Redundant Features,Candidate Solutions,Improve Classification Accuracy,Multi-objective Algorithm,Machine Learning Tasks,NSGA-II Algorithm,Population Size,Training Set,Hyperparameters,Step Size,Real-valued,General Solution,Particle Swarm,Pareto Front,Compact Method,Real-world Datasets,Multi-objective Evolutionary Algorithms,Pareto Front Solutions,Premature Convergence,Classification Error,Population Of Solutions,Feature Subset,Large Step Size
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要