Probabilistic Parsing of Unrestricted English Text, With a I-Iighly-Detailed Grammar.

VLC(1997)

引用 25|浏览13
暂无评分
摘要
Summary A grammar-based probabilistic parser is described, and experimental results are pre- sented for the parser as trained and tested on a 676,000-word, highly varied treebank of unrestricted English text. Probabilistic decision trees are utilized as a means of prediction, and a grammar with about 3000 semantic-and-syntactic tags, and 1100 non-terminal node labels suppl/es detailed lingzdstic information. Further such data is supplied for prediction purposes by thousands of questions about "raw" words, expres~ sions, and the sentence as a whole. The rich/n.formation base used for parse prediction allows the system to parse in a domain-general, totally--open-vocabulary setting, and to output highly-detailed semantic as well as syntactic information for sentences pro- ccessed. Finally, a statistical procedure is described for converting less-detailed into more--detailed treebank, for use in increasing parser accuracy via much larger training treeb~.nlc~.
更多
查看译文
关键词
decision tree
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要