Chrome Extension
WeChat Mini Program
Use on ChatGLM

Geographic Disaggregation of Textual Social Media Data: A Machine Learning-based Approach

Procedia Computer Science(2022)

Cited 1|Views5
No score
Abstract
This research aims to identify the geographic origin of Arabic-speaking social media users by analyzing textual data they produce and share. The paper presents an approach to infer users’ region (i.e country) of origin through identification of the dialect they use in their written interactions. An Integrated Dataset for Arabic Dialect Detection (IADD) is proposed and used to train multiple classifiers which succeed in identifying the users’ region and country of origin with an accuracy of 0.89 and 0.93, respectively.
More
Translated text
Key words
Automatic Dialect Identification,Arabic Language,Geographic Disaggregation,Machine Learning
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined