A dataset for Sentiment analysis of Entities in News headlines (SEN)

KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021)(2021)

引用 2|浏览5
暂无评分
摘要
On-line news portals play a very important role in the information society. Fair media should present reliable and objective information. In practice there is an observable positive or negative bias concerning named entities (e.g. politicians) mentioned in the on-line news headlines. In this paper we present SEN - a novel publicly available human-labelled dataset for training and testing machine learning algorithms for the problem. It consists of 3819 human-labelled political news headlines coming from several major on-line media outlets in English and Polish. We also describe the process of preparing the dataset and present its analysis, including entity and annotator bias analysis, and some insights into possible challenges of the task of entity-level analysis of the news. (C) 2021 The Authors. Published by Elsevier B.V.
更多
查看译文
关键词
benchmark dataset, sentiment analysis, on-line news, media bias, supervised machine learning, annotator bias
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要