Word Segmentation Method For Oracle Bone Inscriptions Based On Dictionary And Syntactic Rules

2012 7TH INTERNATIONAL CONFERENCE ON SYSTEM OF SYSTEMS ENGINEERING (SOSE)(2012)

引用 2|浏览5
暂无评分
摘要
According to Oracle Bone Inscriptions(OBI)'s own characteristics, proposed a combination of thesaurus-based word frequency, part of speech, syntactic rules segmentation algorithm. Firstly, the method obtained by the initial OBI dictionary segmentation result, and then using OBI's grammar rulse and word entry filter rules to get segmentation result; Finally, using the unknown words identification rules to check the result, the words which meet the unknown word's condition threshold will be added into the dictionary. The segmentation method can largely remove ambiguities and improve the identification of unknown words probability. Experimental results show that the method is consistent with the grammatical features of OBI, and it can obtain high accuracy and recall.
更多
查看译文
关键词
Oracle Bone Inscriptions,word segmentation,syntactic rules,disambiguation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要