Exploring Distinctive Features in Distant Supervision for Relation Extraction.

AIRS(2013)

引用 3|浏览81
暂无评分
摘要
Distant supervision (DS) for relation extraction suffers from the noisy labeling problem. Most solutions try to model the noisy instances in the form of multi-instance learning. However, in the non-noisy instances, there may be noisy features which would harm the extraction model. In this paper, we employ a novel approach to address this problem by exploring distinctive features and assigning distinctive features more weight than the noisy ones. We make use of all the training data (both the labeled part that satisfies the DS assumption and the part that does not), and then employ an unsupervised method by topic model to discover the distribution of features to latent relations. At last, we compute the distinctiveness of features by using the obtained feature-relation distribution, and assign features weights based on their distinctiveness to train the extractor. Experiments show that the approach outperforms the baseline methods in both the held-out evaluation and the manual evaluation significantly. © 2013 Springer-Verlag.
更多
查看译文
关键词
distant supervision,distinctive features,relation extraction
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要