Sentiment Analysis of Reviews in Natural Language: Roman Urdu as a Case Study

IEEE ACCESS(2022)

引用 10|浏览5
暂无评分
摘要
Opinion Mining from user reviews is an emerging field. Sentiment Analysis of Natural Language text helps us in finding the opinion of the customers. These reviews can be in any language e.g. English, Chinese, Arabic, Japanese, Urdu, and Hindi. This research presents a model to classify the polarity of the review(s) in Roman Urdu text (reviews). For the purpose, raw data was scraped from the reviews of 20 songs from Indo-Pak Music Industry. In this research a new dataset of 24000 reviews of Roman Urdu text is created. Nine Machine Learning algorithms-Naive Bayes, Support Vector Machine, Logistic Regression, K-Nearest Neighbors, Artificial Neural Networks, Convolutional Neural Network, Recurrent Neural Networks, ID3 and Gradient Boost Tree, are attempted. Logistic Regression outperformed the rest, based on testing and cross validation accuracies that are 92.25% and 91.47% respectively.
更多
查看译文
关键词
Sentiment analysis, Videos, Filtration, Annotations, Standards, Licenses, Benchmark testing, Sentiment analysis, sentiment classification, Roman Urdu, supervised learning, song reviews, Roman Urdu corpus, machine learning, Naive Bayes, decision tree, K-NN, deep learning, ANN, CNN, RNN, text classification
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要