Optimizing the prototypes with a novel data weighting algorithm for enhancing the classification performance of fuzzy clustering

Fuzzy Sets and Systems(2021)

Cited 8|Views26
No score
Abstract
Fuzzy clustering is regarded as an unsupervised learning process that constitutes a prerequisite for many other data mining techniques. Deciding how to classify data efficiently and accurately has been one of the topics pursued by many researchers. We anticipate that the classification performance of the clustering is strongly dependent on the boundary data (viz. data located at the boundaries of the clusters). The boundary data hold some levels of uncertainties and as such contain more information than others. Usually the greater the uncertainty, the more information contained in such data. To improve the quality of clustering, this study develops an augmented scheme of fuzzy clustering, in which a novel weighted data-based fuzzy clustering is proposed. In the introduced scheme, a dataset is composed of boundary data and non-boundary data. The partition matrix is used to determine the boundary data and the non-boundary data to be next considered in the clustering process. Then, we assign different weights to each datum to construct the weighted data. During this process, we make the weights for the boundary data and the non-boundary data different, which makes the contributions of the boundary data and the non-boundary data to the prototypes being reduced and enhanced, respectively. Furthermore, we build a weighting function to determine the weights of the data. The weighted data are used to optimize the prototypes. With the optimized prototypes, the partition matrix can be refined, which ultimately makes the boundaries of the clusters optimized. Finally, the classification performance of fuzzy clustering is enhanced. We offer a thorough analysis of the developed scheme. Comprehensive experimental studies involving synthetic and publicly available datasets are reported to demonstrate the performance of the proposed approach.
More
Translated text
Key words
Fuzzy C-Means (FCM),Weighted data,Partition matrix,Prototypes,Weighting function
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined