N-gram based approach to recognize the twitter accounts of Turkish daily newspapers

2017 International Artificial Intelligence and Data Processing Symposium (IDAP)(2017)

Cited 0|Views1
No score
Abstract
Twitter is one of the most popular social media networks in the world. It is also mostly used by corporate companies, media as well as individual users. Media organizations use Twitter to announce about the news. Although the language of the given news is formal and preferred words to share information are different for each organization. In this study, we proposed an approach to recognize the Twitter accounts of Turkish daily newspapers. Our approach is based on character 3-grams and word 2-grams for digitizing the texts. In order to classify the information, we performed the experiments on several classifiers and found that Sequential Minimal Optimization (SMO) outperformed other algorithms. We carried out the experiments on the real-dataset of Twitter accounts of Turkish daily newspapers and classified them accurately more than 98%.
More
Translated text
Key words
text classification,twitter,n-grams,support vector machine,social media networks
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined