Chrome Extension
WeChat Mini Program
Use on ChatGLM

Representations for multi-document event clustering

Data Mining and Knowledge Discovery(2012)

Cited 7|Views4
No score
Abstract
We study several techniques for representing, fusing and comparing content representations of news documents. As underlying models we consider the vector space model (both in a term setting and in a latent semantic analysis setting) and probabilistic topic models based on latent Dirichlet allocation. Content terms can be classified as topical terms or named entities, yielding several models for content fusion and comparison. All used methods are completely unsupervised. We find that simple methods can still outperform the current state-of-the-art techniques.
More
Translated text
Key words
Text mining,Probabilistic content models,Clustering
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined