谷歌浏览器插件
订阅小程序
在清言上使用

Neural Text Categorization with Transformers for Learning Portuguese as a Second Language

PROGRESS IN ARTIFICIAL INTELLIGENCE (EPIA 2021)(2021)

引用 0|浏览15
暂无评分
摘要
We report on the application of a neural network based approach to the problem of automatically categorizing texts according to their proficiency levels and suitability for learners of Portuguese as a second language. We resort to a particular deep learning architecture, namely Transformers, as we fine-tune GPT-2 and RoBERTa on data sets labeled with respect to the standard CEFR proficiency levels, that were provided by Camoes IC, the Portuguese official language institute. Despite the reduced size of the data sets available, we found that the resulting models overperform previous carefully crafted feature based counterparts in most evaluation scenarios, thus offering a new state-ofthe-art for this task in what concerns the Portuguese language.
更多
查看译文
关键词
Readability classification, Language proficiency, Neural networks, Deep learning, Portuguese
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要