TTK: A toolkit for Tunisian linguistic analysis

COMPUTER SPEECH AND LANGUAGE(2024)

引用 0|浏览1
暂无评分
摘要
Over the last two decades, many efforts have been made to provide resources to support the Arabic Natural Language Processing (NLP). Some of these resources target specific NLP tasks such as word tokenization, parsing, or sentiment analysis, while others attempt to tackle numerous tasks at once. In this paper, we present ??TTK, a toolkit for Tunisian linguistic analysis. It consists of a collection of linguistic analysis tools for orthographic normalization, sentence boundaries detection, word tokenization, morphological analysis, parsing and named entity recognition. This paper focuses on the design and implementation of TTK tools
更多
查看译文
关键词
Arabic Dialect,Tunisian Arabic,Toolkit,Orthographic normalization,Sentence boundaries detection,Word tokenization,Morphological analysis,Parsing,Named entity recognition
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要