Audio-Driven High Definetion and Lip-Synchronized Talking Face Generation Based on Face Reenactment

Xianyu Wang,Yuhan Zhang,Weihua He,Yaoyuan Wang,Minglei Li, Yuchen Wang,Jingyi Zhang, Shunbo Zhou,Ziyang Zhang

ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)(2023)

引用 0|浏览4
暂无评分
摘要
Generating audio-driven photo-realistic talking face has received intensive attention due to its ability to bring more new human-computer interaction experiences. However, previous works struggled to balance high definition, lip synchronization, and low customization costs, which would degrade the user experience. In this paper, a novel audio-driven talking face generation method was proposed, which subtly converts the problem of improving video definition into the problem of face reenactment to produce both lip-synchronized and high- definition face video. The framework is decoupled, meaning that the same trained model can be used on arbitrary characters and audio without further customizing training for specific people, thus significantly reducing costs. Experiment results show that our proposed method achieves the high video definition, and comparable lip synchronization performance with the existing state-of-the-art methods.
更多
查看译文
关键词
Talking face generation,Lip sync,High definition,Audio driven animation,Face reenactment
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要