SMART: A Stratified Machine Reading Test

Lecture Notes in Artificial Intelligence(2019)

引用 2|浏览66
暂无评分
摘要
We present a Stratified MAchine Reading Test (SMART) data set for Chinese in which each question is assigned a "level" that reflects the type of reasoning that is needed to answer the question. This data set consists of close to 40K question-answer pairs and its stratified design allows machine reading researchers to quickly focus in on areas that present the most challenge for a machine comprehension system. We further establish a baseline for future research with BERT, and present results that show the levels we have designed correspond well with the level of difficulty that BERT experiences in answering these questions, as reflected by the lower accuracy for higher levels. We have also collected human answers to the questions in the test portion of this data set, and show that humans and the machine have different challenges when answering these questions. This means that even though the machine is approaching human-level performance on this task, humans and the machine perform this task with very different mechanisms.
更多
查看译文
关键词
test,smart
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要