A novel classification approach based on Naïve Bayes for Twitter sentiment analysis.

KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS(2017)

引用 21|浏览8
暂无评分
摘要
With rapid growth of web technology and dissemination of smart devices, social networking service(SNS) is widely used. As a result, huge amount of data are generated from SNS such as Twitter, and sentiment analysis of SNS data is very important for various applications and services. In the existing sentiment analysis based on the Naive Bayes algorithm, a same number of attributes is usually employed to estimate the weight of each class. Moreover, uncountable and meaningless attributes are included. This results in decreased accuracy of sentiment analysis. In this paper two methods are proposed to resolve these issues, which reflect the difference of the number of positive words and negative words in calculating the weights, and eliminate insignificant words in the feature selection step using Multinomial Naive Bayes(MNB) algorithm. Performance comparison demonstrates that the proposed scheme significantly increases the accuracy compared to the existing Multivariate Bernoulli Naive Bayes(BNB) algorithm and MNB scheme.
更多
查看译文
关键词
Twitter sentiment analysis,Machine learning,Naive Bayes,Attribute weighting,Feature selection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要