A Novel Text Classification Approach Based on Word2vec and TextRank Keyword Extraction

2019 IEEE Fourth International Conference on Data Science in Cyberspace (DSC)(2019)

引用 5|浏览7
暂无评分
摘要
In the era of big data, a practical challenge is how to extract the keywords and classify the content of the massive text data efficiently. To address this challenge, through extensive research of existing approaches, we propose a novel text classification approach based on TextRank and Word2vec. In our approach, we integrate the external influence of the Word2vec training model and the location-aware weighting influence of the keywords into the Textrank module. Furthermore, we propose a novel text re-classification algorithm based on the combination of the keywords and KNN. The experiment results show that our approach achieves better accuracy and performance compared to TF-IDF and TextRank algorithm.
更多
查看译文
关键词
keyword extraction,text classification,Word2vec,TextRank,KNN,time-based efficiency
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要