Chrome Extension
WeChat Mini Program
Use on ChatGLM

Feature selection based on difference and similitude in data mining

Wuhan University Journal of Natural Sciences(2007)

Cited 2|Views12
No score
Abstract
Feature selection is the pretreatment of data mining. Heuristic search algorithms are often used for this subject. Many heuristic search algorithms are based on discernibility matrices, which only consider the difference in information system. Because the similar characteristics are not revealed in discernibility matrix; the result may not be the simplest rules. Although difference-similitude(DS) methods take both of the difference and the similitude into account, the existing search strategy will cause some important features to be ignored. An improved DS based algorithm is proposed to solve this problem in this paper. An attribute rank function, which considers both of the difference and similitude in feature selection, is defined in the improved algorithm. Experiments show that it is an effective algorithm, especially for large-scale databases. The time complexity of the algorithm is O (| C | 2 | U | 2 ).
More
Translated text
Key words
knowledge reduction,feature selection,rough set,difference set,similitude set,attribute rank function
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined