MAMGAN: Multiscale attention metric GAN for monaural speech enhancement in the time domain

Huimin Guo,Haifang Jian,Yequan Wang,Hongchang Wang, Xiaofan Zhao,Wenqi Zhu, Qinghua Cheng

Applied Acoustics(2023)

Cited 1|Views1
No score
Abstract
•This paper proposes a multiscale attention metric GAN for monaural SE in the time domain, which can effectively reduce noise and improve the quality and intelligibility of speech.•This paper proposes a multiscale attention module to act as the bottleneck layer to improve the feature extraction ability of long sequence features.•This paper tests the proposed model on two benchmark datasets. The results show that the proposed method outperforms the current state-of-the-art (SOTA) time-domain SE methods.
More
Translated text
Key words
Speech enhancement,Time domain,Multiscale attention,Attention metric discriminator
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined