Named Entity Recognition For Amharic Using Deep Learning

2017 IST-AFRICA WEEK CONFERENCE (IST-AFRICA)(2017)

引用 2|浏览20
暂无评分
摘要
The paper describes a named entity recognition system for Amharic, an under-resourced language, using a recurrent neural network, a bi-directional long short term memory model to identify and classify tokens into six predefined classes: Person, Location, Organization, Time, Title, and Other (non-named entity tokens). Word vectors based on semantic information are built for all tokens using an unsupervised learning algorithm, word2vec. The word vectors were merged with a set of specifically developed language independent features and together fed to the neural network model to predict the classes of the words. When evaluated by 10-fold cross-validation, the created Amharic named entity recogniser achieved good average precision (77.2%), but did worse on recall (63.4%), for a 69.7% F-1-score.
更多
查看译文
关键词
Named Entity Recognition,Amharic,Under-resourced languages,Recurrent neural network,Long short term memory
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要