Binaural Codebook-Based Speech Enhancement With Atomic Speech Presence Probability

IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)(2019)

引用 10|浏览19
暂无评分
摘要
In this work, we present a universal codebook-based speech enhancement framework that relies on a single codebook to encode both speech and noise components. The atomic speech presence probability ASPP is defined as the probability that a given codebook atom encodes speech at a given point in time. We develop ASPP estimators based on binaural cues including the interaural phase and level difference IPD and ILD, the interaural coherence magnitude ICM, as well as a combined version leveraging the full interaural transfer function ITF. We evaluate the performance of the resulting ASPP-based speech enhancement algorithms on binaural mixtures of reverberant speech and real-world noise. The proposed approach improves both objective speech quality and intelligibility over a wide range of input SNR, as measured with PESQ and binaural STOI metrics, outperforming two binaural speech enhancement benchmark methods. We show that the proposed ITF-based ASPP approach achieves a good balance of the trade-off between binaural noise reduction and binaural cue preservation.
更多
查看译文
关键词
Speech enhancement,Speech coding,Noise reduction,Noise measurement,Estimation,Indexes
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要