Semantic role labeling for open information extraction

FAM-LbR '10: Proceedings of the NAACL HLT 2010 First International Workshop on Formalisms and Methodology for Learning by Reading(2010)

引用 42|浏览167
暂无评分
摘要
Open Information Extraction is a recent paradigm for machine reading from arbitrary text. In contrast to existing techniques, which have used only shallow syntactic features, we investigate the use of semantic features (semantic roles) for the task of Open IE. We compare TextRunner (Banko et al., 2007), a state of the art open extractor, with our novel extractor SRL-IE, which is based on UIUC's SRL system (Punyakanok et al., 2008). We find that SRL-IE is robust to noisy heterogeneous Web data and outperforms TextRunner on extraction quality. On the other hand, TextRunner performs over 2 orders of magnitude faster and achieves good precision in high locality and high redundancy extractions. These observations enable the construction of hybrid extractors that output higher quality results than TextRunner and similar quality as SRL-IE in much less time.
更多
查看译文
关键词
extraction quality,novel extractor SRL-IE,output higher quality result,similar quality,Open IE,Open Information Extraction,art open extractor,high locality,high redundancy extraction,hybrid extractor,open information extraction,semantic role
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要