An Ontology-Based Framework For Automatic Topic Detection In Multilingual Environments

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS(2018)

引用 20|浏览13
暂无评分
摘要
The detection of topics from large textual data volumes is currently a research area, which has many applications in the development of computational systems. A proposed solution for the detection of topics in data mining is the application of clustering methods. This paper presents the application of a new ontology-based methodology for the automatic topic detection without any previous information based on the use of hierarchical clustering algorithms and a multilingual knowledge base. The approach also includes lexical resources that allow us to enrich the semantics of the analyzed texts. The novelty of this approach consists of the dimensionality reduction of the terms present in the texts by using ontology and the introduction of a method for the creation of a term weight matrix for use in clustering algorithms. With this approach, it is possible to improve automatic topic detection in documents. The proposed methodology was assessed with four datasets (two of them in English and two in Spanish).
更多
查看译文
关键词
multilingual topic detection, ontologies, text clustering, text Mining
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要