Multimodal Fusion-based Swin Transformer for Facial Recognition Micro-Expression Recognition

Xinhua Zhao,Yongjia Lv, Zheng Huang

PROCEEDINGS OF 2022 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2022)(2022)

引用 3|浏览2
暂无评分
摘要
Micro-expression recognition is the domain of vigorous computational vision research, which up against significant challenges stems from micro-expressions being spontaneous, brief and faint facial muscle movements. The paper presents a very novel method of Multimodal fusion micro-expression recognition using a visual transformer, which is not commonly used for micro-expression recognition. As compared to convolutional neural networks, transformers are widely thought to require more data. Then, we choose similar expression datasets to pre-training the model, while increasing the number of datasets.The results of the validation and evaluation of the model conducted with the CASME II, MMEW and SMIC datasets yielded state-of-the-art performance in terms of average accuracy of 81.50%, 82.97%, and 79.99%, respectively.When using Score-CAM to obtain the facial expression activation heat map, it is obvious that our model matches well with the expression action units. The proposed model obtains more promising recognition results than many other recognition methods.
更多
查看译文
关键词
Micro-expression, Apex frame, Difference, Vision Transformer
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要