Modified Parametric Multichannel Wiener Filter for Low-latency Enhancement of Speech Mixtures with Unknown Number of Speakers

2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC(2023)

引用 0|浏览31
暂无评分
摘要
This paper introduces a novel low-latency online beamforming (BF) algorithm called the Modified Parametric Multichannel Wiener Filter (Mod-PMWF), which enhances speech mixtures with unknown and varying number of speakers. Although such conventional BFs as Linearly Constrained Minimum Variance BFs (LCMV BFs) can enhance a speech mixture, they typically require such speech mixture attributes as the number of speakers and the acoustic transfer functions (ATFs) from the speakers to the microphones. When mixture attributes are unavailable, estimating them by low-latency processing is challenging, hindering the application of BFs to the problem. In this paper, we overcome this problem by modifying a conventional Parametric Multichannel Wiener Filter (PMWF). The proposed Mod-PMWF can adaptively form a directivity pattern that enhances all the speakers in the mixture without explicitly estimating these attributes. Our experiments show the proposed BF's effectiveness in interference reduction ratios and subjective listening tests.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要