AV2Wav: Diffusion-Based Re-synthesis from Continuous Self-supervised Features for Audio-Visual Speech Enhancement
IEEE International Conference on Acoustics, Speech, and Signal Processing(2024)
Key words
Audio-Visual Speech Recognition,Speech Enhancement,Automatic Speech Recognition,End-to-End Speech Recognition,Acoustic Modeling
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined