An Empirical Analysis of PoS Tagging for Kannada Machine Translation

Jamuna,H R Mamatha

2023 International Conference on Applied Intelligence and Sustainable Computing (ICAISC)(2023)

引用 0|浏览3
暂无评分
摘要
Parts of Speech (POS) tagging process has emerged as one of the crucial and very basic preprocessing technique for any natural language processing tasks. In Kannada, a dominant language in southern India, morphologically rich and low resourced at the same time, PoS tagging process was difficult to achieve in the beginning. Later, studies show that Conditional Random Fields, Hidden Morkov Model and deep learning techniques have produced good accuracy. This paper investigates all the three above mentioned models and argues that deep learning model, which uses bidirectional Long Short Term Memory as a RNN unit, produces the highest accuracy of 93% in contrast to CRF and HMM model with a precision accuracy of 65% and 42% respectively. Also, the paper specifies how important a PoS tagging process is in the task of Machine Translation, which is booming in the world of computational linguistics.
更多
查看译文
关键词
Conditional Random Fields (CRF),Word2Vec,Hidden Morkov Model (HMM),bidirectional Long Short Term Memory (BiLSTM)
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要