Towards Integration of Embodiment Features for Prosodic Prominence Prediction from Text.

International Conference on Multimodal Interaction (ICMI)(2022)

引用 0|浏览1
暂无评分
摘要
Prosodic prominence prediction is an important task in the area of speech processing and especially forms an essential part of modern text-to-speech systems. Previous work has broadly focused on acoustic and linguistic features (such as syntactic and semantic features) for predicting prosodic prominence. However, human models of prosody are known to be highly multimodal and grounded on denotations of physical entities and embodied experience. In this paper we present a first study where we integrate multimodal sensorimotor associations by exploiting the Lancaster Sensorimotor Norms towards prosodic prominence prediction. Our results highlight the importance of sensorimotor knowledge especially for models in low-data regimens where we show that it improves the performance by a significant margin.
更多
查看译文
关键词
prosodic prominence prediction,embodiment features,text
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要