Extraction of Fundamental Frequency From Degraded Speech Using Temporal Envelopes at High SNR Frequencies.

IEEE/ACM Trans. Audio, Speech & Language Processing(2017)

引用 19|浏览17
暂无评分
摘要
In this paper we propose a method for extracting the fundamental frequency fo from degraded speech signals using single frequency filtering SFF approach. The SFF of frequency-shifted speech signal gives high signal-to-noise ratio SNR segments at some frequencies and hence the SFF approach can be exploited for fo extraction using autocorrelation function of those segments. Since the fo is computed from the envelope of a single frequency component of the signal, the vocal tract resonances do not affect the fo extraction. The use of the high SNR frequency component in a given segment helps in overcoming the effects of degradations in the speech signal, without explicitly estimating the characteristics of noise. The proposed method of fo extraction is shown to give better performance for several types of real and simulated degradations, in comparison with some of the methods reported recently in the literature.
更多
查看译文
关键词
Speech,Degradation,Resonant frequency,Estimation,Time-frequency analysis,Harmonic analysis,Signal to noise ratio
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要