Generalized Wiener filtering with fractional power spectrograms
2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)(2015)
摘要
In the recent years, many studies have focused on the single-sensor separation of independent waveforms using so-called soft-masking strategies, where the short term Fourier transform of the mixture is multiplied element-wise by a ratio of spectrogram models. When the signals are wide-sense stationary, this strategy is theoretically justified as an optimal Wiener filtering: the power spectrograms of the sources are supposed to add up to yield the power spectrogram of the mixture. However, experience shows that using fractional spectrograms instead, such as the amplitude, yields good performance in practice, because they experimentally better fit the additivity assumption. To the best of our knowledge, no probabilistic interpretation of this filtering procedure was available to date. In this paper, we show that assuming the additivity of fractional spectrograms for the purpose of building soft-masks can be understood as separating locally stationary α-stable harmonizable processes, α-harmonizable in short, thus justifying the procedure theoretically.
更多查看译文
关键词
audio source separation,probability theory,harmonizable processes,α-stable random variables,soft-masks
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络