PercepNet plus : A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement

Conference of the International Speech Communication Association (INTERSPEECH)(2022)

引用 2|浏览9
暂无评分
摘要
PercepNet, a recent extension of the RNNoise, an efficient, high-quality and real-time full-band speech enhancement technique, has shown promising performance in various public deep noise suppression tasks. This paper proposes a new approach, named PercepNet+, to further extend the PercepNet with four significant improvements. First, we introduce a phase-aware structure to leverage the phase information into PercepNet, by adding the complex features and complex sub-band gains as the deep network input and output respectively. Then, a signal-to-noise ratio (SNR) estimator and an SNR-switched post-processing are specially designed to alleviate the over attenuation (OA) that appears in high SNR conditions of the original PercepNet. Moreover, the GRU layer is replaced by TF-GRU to model both temporal and frequency dependencies. Finally, we propose to integrate the loss of complex subband gain, SNR, pitch filtering strength, and an OA loss in a multi-objective learning manner to further improve the speech enhancement performance. Experimental results show that, the proposed PercepNet+ outperforms the original PercepNet significantly in terms of both PESQ and STOI, without increasing the model size too much.
更多
查看译文
关键词
speech enhancement, phase-aware structure, SNR-switched post-processing, multi-objective learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要