Chrome Extension
WeChat Mini Program
Use on ChatGLM

Exploring Transfer Learning For Low Resource Emotional Tts

INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1(2020)

Cited 52|Views16
No score
Abstract
During the last few years, spoken language technologies have known a big improvement thanks to Deep Learning. However Deep Learning-based algorithms require amounts of data that are often difficult and costly to gather. Particularly, modeling the variability in speech of different speakers, different styles or different emotions with few data remains challenging. In this paper, we investigate how to leverage fine-tuning on a pre-trained Deep Learning-based TTS model to synthesize speech with a small dataset of another speaker. Then we investigate the possibility to adapt this model to have emotional TTS by fine-tuning the neutral TTS model with a small emotional dataset.
More
Translated text
Key words
Speech synthesis, Emotion, Deep learning, Transfer learning, Fine-tuning
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined