An efficient framework for real-time tweet classification

International Journal of Information Technology(2017)

引用 20|浏览2
暂无评分
摘要
Increasing popularity of social networking sites like facebook, twitter, google+ etc. is contributing in fast proliferation of big data. Amongst social Networking sites, twitter is one of the most common source of big data where people from across the world share their views on various topics and subjects. With daily Active user count of 100-million+ users twitter is becoming a rich information source for finding trends and current happenings around the world. Twitter does provide a limited “trends” feature. To make twitter trends more interesting and informative, in this paper we propose a framework that can analyze twitter data and classify tweets on some specific subject to generate trends. We illustrate the use of framework by analyzing the tweets on “Politics” domain as a subject. In order to classify tweets we propose a tweet classification algorithm that efficiently classify the tweets.
更多
查看译文
关键词
Big Data, Apache Spark, HDFS, RDDs
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要