Kernel-Based k-Representatives Algorithm for Fuzzy Clustering of Categorical Data

2021 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE)(2021)

引用 3|浏览2
暂无评分
摘要
Fuzzy cluster analysis plays an essential role in addressing unclear boundaries between clusters in data and aims to group objects into fuzzy clusters based on their similarities. In this paper, we propose a new method for fuzzy clustering of data with categorical attributes. Specifically, we first introduce a method for kernel-based representation of cluster centers in which the underlying distribution of categorical values within a cluster center is estimated as a weighted sum of the uniform distribution and their frequency distribution. We then extend the k-centers clustering method by applying this newly proposed method of cluster center presentation for fuzzy clustering of categorical data. The effectiveness and efficiency of the proposed method are demonstrated by conducting experiments on 16 realworld datasets and comparing the results with those of existing methods. In addition, our research can be regarded as the first attempt to apply a fuzzy silhouette scoring method that includes internal coherence and external separation of fuzzy clusters into clustering of categorical data.
更多
查看译文
关键词
Fuzzy clustering,Fuzzy silhouette,Categorical data,k-representatives
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要