Towards Interpretable Summary Evaluation via Allocation of Contextual Embeddings to Reference Text Topics

Ben Schaper,Christopher Lohse, Marcell Streile, Andrea Giovannini,Richard Osuala

arxiv(2022)

引用 0|浏览13
暂无评分
摘要
Despite extensive recent advances in summary generation models, evaluation of auto-generated summaries still widely relies on single-score systems insufficient for transparent assessment and in-depth qualitative analysis. Towards bridging this gap, we propose the multifaceted interpretable summary evaluation method (MISEM), which is based on allocation of a summary's contextual token embeddings to semantic topics identified in the reference text. We further contribute an interpretability toolbox for automated summary evaluation and interactive visual analysis of summary scoring, topic identification, and token-topic allocation. MISEM achieves a promising .404 Pearson correlation with human judgment on the TAC'08 dataset.
更多
查看译文
关键词
interpretable summary evaluation,reference contextual
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要