Chrome Extension
WeChat Mini Program
Use on ChatGLM

Feature Selection for Unbalanced Distribution Hybrid Data Based on ${k}$-Nearest Neighborhood Rough Set.

Weihua Xu, Ziting Yuan,Zheng Liu

IEEE transactions on artificial intelligence(2024)

Cited 0|Views27
No score
Abstract
Neighborhood rough sets are now widely used to process numerical data. Nevertheless, most of the existing neighborhood rough sets are not able to distinguish class mixture samples well when dealing with classification problems. That is, it cannot effectively classify categories when dealing with data with an unbalanced distribution. Because of this, in this article, we propose a new feature selection method that takes into consideration both heterogeneous data and feature interaction. The proposed model well integrates the ascendancy of ${\delta }$ -neighborhood and ${k}$ -nearest neighbor. Such heterogeneous data can be handled better than existing neighborhood models. We utilize information entropy theories such as mutual information and conditional mutual information and employ an iterative strategy to define the importance of each feature in decision making. Furthermore, we design a feature extraction algorithm based on the above idea. Experimental results display that the raised algorithm has superior effect than some existing algorithms, particularly the ${\delta }$ -neighborhood rough set model and the ${k}$ -nearest neighborhood rough set model.
More
Translated text
Key words
Feature selection,neighborhood mutual information,neighborhood rough set (NRS),unbalanced distribution
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined