Cluster Analysis of Named Entities
INTELLIGENT INFORMATION PROCESSING AND WEB MINING(2004)
摘要
This paper presents a statistics-based and language independent unsupervised approach for clustering possible named entities. We describe and motivate the features and statistical filters used by our clustering process. Using the Model-Based Clustering Analysis software we obtained different clusters of named entities. The method was applied to Bulgarian and English. For some clusters, precision is close to 100%, this helps human validation and saves time. Other clusters still need further refinement.
更多查看译文
关键词
cluster analysis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要