A Keyphrase Extraction Method Based on Multi-feature Evaluation and Mask Mechanism

2022 11th International Conference on Control, Automation and Information Sciences (ICCAIS)(2022)

引用 0|浏览7
暂无评分
摘要
Keyphrase extraction aims to identify phrases in documents that contain core content. However, existing unsupervised keyphrase extraction models are limited to focusing on a single feature leading to biased results. In response to the above problems, it evaluates keyphrase scores through multiple features of semantic importance, topic diversity, and position features. Firstly, it masked the candidate keyphrase from a document and the Manhattan distance between the mask document and the original document is calculated as the semantic importance feature. Secondly, it calculated the topic-word distribution of candidate keyphrases as topic diversity, and the position features are calculated. Finally, the phrase importance score is calculated by integrating the three sub-models. Experiments are conducted on three academic datasets and compared with six state-of-the-art baseline models, outperforming existing methods. The results show that evaluating phrase importance from multiple features significantly improves the performance of extracting keyphrases.
更多
查看译文
关键词
keyphrase extraction,multi-feature evaluation,mask mechanism,text mining
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要