Transformer based named entity recognition for place name extraction from unstructured text

INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE(2023)

引用 4|浏览34
暂无评分
摘要
Place names embedded in online natural language text present a useful source of geographic information. Despite this, many methods for the extraction of place names from text use pre-trained models that were not explicitly designed for this task. Our paper builds five custom-built Named Entity Recognition (NER) models and evaluates them against three popular pre-built models for place name extraction. The models are evaluated using a set of manually annotated Wikipedia articles with reference to the F-1 score metric. Our best performing model achieves an F-1 score of 0.939 compared with 0.730 for the best performing pre-built model. Our model is then used to extract all place names from Wikipedia articles in Great Britain, demonstrating the ability to more accurately capture unknown place names from volunteered sources of online geographic information.
更多
查看译文
关键词
Named entity recognition, volunteered geographic information, natural language processing, place name extraction
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要