A multilingual clause boundary detection approach for Assamese and Bishnupriya Manipuri

Bhubneswar Das,Smriti Kumar Sinha

2023 4th International Conference on Computing and Communication Systems (I3CS)(2023)

引用 0|浏览4
暂无评分
摘要
Clause boundary detection is an important step in various natural language processing tasks like dependency parsing, machine translation etc. A complex sentence in any language may contain more than one clause. The boundaries should be identified for capturing various dependency relations and for identifying the arguments of the verbal components. We present here, the task of identification and classification of clauses in Assamese and Bishnupriya Manipuri text. To the best of our knowledge, not much work has been done on clause boundary identification for these two languages, which makes this task more important. We have built a rule-based system using linguistic cues such as coordinating conjunct, subordinating conjunct etc. We additionally considered POS tags of neighbouring words as features into the existing conjunction-based approach, which can also be applied to the languages that belongs to the same language family and have certain similarities. Experimental results show that our approach achieves a satisfactory result.
更多
查看译文
关键词
Clause,Conjunction,coordinating conjunct,complex sentence
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要