PORTULAN ExtraGLUE Datasets and Models: Kick-starting a Benchmark for the Neural Processing of Portuguese
arxiv(2024)
摘要
Leveraging research on the neural modelling of Portuguese, we contribute a
collection of datasets for an array of language processing tasks and a
corresponding collection of fine-tuned neural language models on these
downstream tasks. To align with mainstream benchmarks in the literature,
originally developed in English, and to kick start their Portuguese
counterparts, the datasets were machine-translated from English with a
state-of-the-art translation engine. The resulting PORTULAN ExtraGLUE benchmark
is a basis for research on Portuguese whose improvement can be pursued in
future work. Similarly, the respective fine-tuned neural language models,
developed with a low-rank adaptation approach, are made available as baselines
that can stimulate future work on the neural processing of Portuguese. All
datasets and models have been developed and are made available for two variants
of Portuguese: European and Brazilian.
更多查看译文
AI 理解论文
溯源树
样例
![](https://originalfileserver.aminer.cn/sys/aminer/pubs/mrt_preview.jpeg)
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要