Using artificially generated data to evaluate statistical machine translation

GEAF '09: Proceedings of the 2009 Workshop on Grammar Engineering Across Frameworks(2009)

引用 2|浏览10
暂无评分
摘要
Although Statistical Machine Translation (SMT) is now the dominant paradigm within Machine Translation, we argue that it is far from clear that it can outperform Rule-Based Machine Translation (RBMT) on small- to medium-vocabulary applications where high precision is more important than recall. A particularly important practical example is medical speech translation. We report the results of experiments where we configured the various grammars and rule-sets in an Open Source medium-vocabulary multi-lingual medical speech translation system to generate large aligned bilingual corpora for English → French and English → Japanese, which were then used to train SMT models based on the common combination of Giza++, Moses and SRILM. The resulting SMTs were unable fully to reproduce the performance of the RBMT, with performance topping out, even for English → French, with less than 70% of the SMT translations of previously unseen sentences agreeing with RBMT translations. When the outputs of the two systems differed, human judges reported the SMT result as frequently being worse than the RBMT result, and hardly ever better; moreover, the added robustness of the SMT only yielded a small improvement in recall, with a large penalty in precision.
更多
查看译文
关键词
SMT model,SMT result,SMT translation,RBMT result,RBMT translation,Machine Translation,Rule-Based Machine Translation,Statistical Machine Translation,Open Source medium-vocabulary multi-lingual,high precision,statistical machine translation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要