Improve on Entity Recognition Method Based on BiLSTM-CRF Model for the Nuclear Technology Knowledge Graph

Rui Shi,Zhenchuan Wang, Yang Liu, Yunliang Lan,Wei Zhao, Yitang Liu

2022 5th International Conference on Pattern Recognition and Artificial Intelligence (PRAI)(2022)

Cited 1|Views6
No score
Abstract
The accuracy of entity recognition is particularly important for knowledge graph construction. The traditional Named Entity recognition (NER) model mainly includes HMM, CRF, BiLSTM, BiLSTM-CRF, etc. It is difficult to solve the problem of word meaning confusion resulting from the wrong separation at the end of the entity when using the four models for the labeled nuclear technology knowledge data sets. In order to improve the entity recognition effect and address the problem of polysemy in nuclear technology knowledge data set, an improved nuclear technology entity recognition method based on the BERT-BiLSTM-CRF combination model was proposed by comparative experiments. According to the results, it can conclude that the application of the BERT model instead of the Word2Vec algorithm for word vector training is helpful to the model recognition, and the exclusive dictionary word segmentation and part of speech classification of nuclear technology texts contribute to improving the quality of labeled data. The experiment verified that the usage of BERT pre-training model can solve the problem of polysemy in NER to some extent. Meanwhile, it validated the precision in nuclear technology knowledge entity recognition for collected data, which could be used in pre-procedure in nuclear technology knowledge graph construction.
More
Translated text
Key words
nuclear technology knowledge graph,named entity recognition,BERT-BiLSTM-CRF,Vector
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined