谷歌浏览器插件
订阅小程序
在清言上使用

Integrating Information Gain methods for Feature Selection in Distance Education Sentiment Analysis during Covid-19

TEM JOURNAL-TECHNOLOGY EDUCATION MANAGEMENT INFORMATICS(2023)

引用 1|浏览4
暂无评分
摘要
Sentiment analysis is a way to automatically understand and process text data to figure out how someone feels about an opinion sentence. If there are too many reviews, it will take a lot of time and they will start to be biased. Sentiment classification tries to solve this problem by putting user reviews into groups based on whether they are positive, negative, or neutral. The dataset comes from Drone Emprit Academic. It is made up of tweets with the words "online learning method " in them, with as many as 4887 data crawled from them. Information Gain and adaboost on the C4.5 (FS+C4.5) method are used in the feature selection method. We use feature options to get rid of bias and improve accuracy. The results of the experiments will be compared to other algorithms like C4.5 and random forest. Based on the results, the accuracy of the two standard decision tree models (C4.5 and random forest) went up from 48.21% and 50.35% to 94.47 %.The value of how accurate it was went up by 44 percent. The FS+C4.5 model, on the other hand, has an RMSE of 0.204 and a correlation of 0.944. So, adding the feature selection technique to the sentiment analysis of bold learning education can make the C4.5 algorithm even more accurate.
更多
查看译文
关键词
Sentiment Analysis,Random Forest,C4.5,Decision Tree,Twitter
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要