Automatic Wordnet Development For Low-Resource Languages Using Cross-Lingual Wsd

Journal of Artificial Intelligence Research(2016)

引用 16|浏览106
暂无评分
摘要
Wordnet is an effective resource in natural language processing and information retrieval , especially for semantic processing and meaning related tasks. So far wordnet has been constructed in many languages. However, automatic development of wordnet for low-resource languages has not been studied well. In this paper an Expectation-Maximization algorithm is used to train high quality and large scale wordnet for resource-poor languages. The proposed method benefits from cross-lingual word sense disambiguation and develops a wordnet just using a bilingual dictionary and a monolingual corpus. The proposed method has been executed on Persian as a resource-poor language and the resulting wordnet has been evaluated through several experiments. Results show that the induced wordnet has a precision of 90% and recall of 35%.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要