An Approach For Extracting Bilingual Terminology From Wikipedia

DASFAA'08: Proceedings of the 13th international conference on Database systems for advanced applications(2008)

引用 77|浏览402
暂无评分
摘要
With the demand of bilingual dictionaries covering domain-specific terminology, research in the field of automatic dictionary extraction has become popular. However, accuracy and coverage of dictionaries created based on bilingual text corpora are often not sufficient for domain-specific terms. Therefore, we present an approach to extracting bilingual dictionaries from the link structure of Wikipedia, a huge scale encyclopedia that contains a vast amount of links between articles in different languages. Our methods analyze not only these interlanguage links but extract even more translation candidates from redirect page and link text information. In an experiment, we proved the advantages of our methods compared to a traditional approach of extracting bilingual terminology from parallel corpora.
更多
查看译文
关键词
bilingual dictionary,bilingual terminology,bilingual text corpus,domain-specific term,domain-specific terminology,interlanguage link,link structure,link text information,traditional approach,automatic dictionary extraction
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要