Document Clustering in Distributed Environment

Yi Peng,Gang Kou,Yong Shi,Zhengxin chen, Khaled M. Hammouda, Mohamed S. kamel, Douglass R. Cutting,David R. Karger,Jan O. Pedersen, John W. Tukey, Michael Steinbach, George Karypis, Rafal A. Angryk, quotDistributed

semanticscholar(2020)

引用 0|浏览1
暂无评分
摘要
Document clustering has emerged as a widely used technique with the increase in large number of documents that is getting accumulated day by day in various fields like news groups, government organizations, Internet and digital libraries. Document clustering is the process of grouping similar documents into clusters . A good document clustering algorithm should have high intra-cluster similarity and less intercluster similarity. i. e the documents with the clusters should be more relevant compared to the documents of other clusters. In this paper, the implementation of document clustering in distributed environment based on peer to peer network architecture is reviewed. The documents in local site are clustered using K-means algorithm. Hierarchical clustering is obtained when clusters in each peer combine to form the
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要