Affective Processes: stochastic modelling of temporal context for emotion and facial expression recognition

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021(2021)

引用 31|浏览365
暂无评分
摘要
Temporal context is key to the recognition of expressions of emotion. Existing methods, that rely on recurrent or self-attention models to enforce temporal consistency, work on the feature level, ignoring the task-specific temporal dependencies, and fail to model context uncertainty. To alleviate these issues, we build upon the framework of Neural Processes to propose a method for apparent emotion recognition with three key novel components: (a) probabilistic contextual representation with a global latent variable model; (b) temporal context modelling using task-specific predictions in addition to features; and (c) smart temporal context selection. We validate our approach on four databases, two for Valence and Arousal estimation (SEWA and AffWild2), and two for Action Unit intensity estimation (DISFA and BP4D). Results show a consistent improvement over a series of strong baselines as well as over state-of-the-art methods.
更多
查看译文
关键词
global latent variable model,task-specific predictions,smart temporal context selection,affective Processes,stochastic modelling,facial expression recognition,self-attention models,temporal consistency,task-specific temporal dependencies,context uncertainty,apparent emotion recognition,action unit intensity estimation,DISFA,BP4D,probabilistic contextual representation,temporal context modelling,valence and arousal estimation,AffWild2,SEWA,neural processes
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要