Retrieval of Weighted Lexicons Based on Supervised Learning Method
Communications in computer and information science(2023)
摘要
Lexicons are a lexical resource that has been used successfully in sentiment analysis and other areas of natural language processing. Although there are several unweighted lexicons and weighted lexicons, they all achieve poor performance in many applications. This is because they are created in general contexts, and adding more terms to an existing lexicon is complicated. Furthermore, current methods for generating weighted lexicons are complex and not very intuitive. In this article, we show the results of a method to generate weighted lexicons from a tagged corpus. The terms that make up the lexicon, as well as the corresponding weights, are obtained by means of a distance measure that is closely related to the probability that a document belongs to its label. The preliminary results obtained with a corpus of 405 documents show that the method reaches an accuracy of 92.3%.
更多查看译文
关键词
weighted lexicons,retrieval,supervised learning method
AI 理解论文
溯源树
样例
![](https://originalfileserver.aminer.cn/sys/aminer/pubs/mrt_preview.jpeg)
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要