Arabic hate speech detection system based on AraBERT

Palé Ollo Salomon,Zied Kechaou,Ali Wali

2022 IEEE 21st International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC)(2022)

引用 0|浏览1
暂无评分
摘要
Tunisia has entered a phase freedom of speech with access to social media since the Jasmine Revolution in 2011. Toxic contents such as abusive and hateful speeches have become omnipresent on Tunisian social media. Considering the side effects of these toxic contents on the psychology of users, it is necessary to detect them automatically. The dialect of Tunisian is underrepresented. As a consequence, there is not enough data set. In this paper, we present the data collection process with the aim of having a Tunisian reference dataset, to evaluate different models of hate speech and abuse detection. We also present our neural network model based on AraBERT. Our experimental results on our dataset shows that the AraBERT model performs better with an F1 score of 0.99.
更多
查看译文
关键词
Tunisian dialect,Hate speech,Social media,Arabert
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要