M-Arg - Multimodal Argument Mining Dataset for Political Debates with Audio and Transcripts.

Rafael Mestre, Razvan Milicin,Stuart Middleton,Matt Ryan,Jiatong Zhu,Timothy J. Norman

ArgMining@EMNLP（2021）

引用 0|浏览3

暂无评分

摘要

Argumentation mining aims at extracting, analysing and modelling people’s arguments, but large, high-quality annotated datasets are limited, and no multimodal datasets exist for this task. In this paper, we present M-Arg, a multimodal argument mining dataset with a corpus of US 2020 presidential debates, annotated through crowd-sourced annotations. This dataset allows models to be trained to extract arguments from natural dialogue such as debates using information like the intonation and rhythm of the speaker. Our dataset contains 7 hours of annotated US presidential debates, 6527 utterances and 4104 relation labels, and we report results from different baseline models, namely a text-only model, an audio-only model and multimodal models that extract features from both text and audio. With accuracy reaching 0.86 in multimodal models, we find that audio features provide added value with respect to text-only models.

查看译文

关键词

multimodal argument mining dataset,political debates,transcripts,m-arg

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要