Text driven virtual speakers.

Vladimir Obradovic, Ilija Rajak,Milan Secujski,Vlado Delic

European Signal Processing Conference (EUSIPCO)(2022)

引用 0|浏览3
暂无评分
摘要
Online courses have had exponential growth during COVID-19 pandemic, and video lectures are also important for lifelong learning. However, lecturers experience a number of challenges in creating video lectures, related to both speech recording (microphone and noise; diction, articulation and intonation) and video recording (camera and light; consistency in appearance). It is particularly difficult to modify and update recorded content. The paper presents a solution for these problems based on the application of artificial intelligence in creating virtual speakers based on TTS synthesis and Wav2Lip GAN trained on a custom data set. A pilot project which included the evaluation and testing of the developed system by dozens of teachers will be presented in detail. The use of TTS overcomes the problems in achieving speaker consistency by providing high quality speech in different languages, while the attention and motivation of students is improved by using animated virtual speakers.
更多
查看译文
关键词
virtual speakers,text
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要