Perceptual Equivalence Of The Liljencrants-Fant And Linear-Filter Glottal Flow Models

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA(2021)

引用 1|浏览4
暂无评分
摘要
Speech glottal flow has been predominantly described in the time-domain in past decades, the Liljencrants-Fant (LF) model being the most widely used in speech analysis and synthesis, despite its computational complexity. The causal/anti-causal linear model (LFCALM) was later introduced as a digital filter implementation of LF, a mixed-phase spectral model including both anti-causal and causal filters to model the vocal-fold open and closed phases, respectively. To further simplify computation, a causal linear model (LFLM) describes the glottal flow with a fully causal set of filters. After expressing these three models under a single analytic formulation, we assessed here their perceptual consistency, when driven by a single parameter R-d related to voice quality. All possible paired combinations of signals generated using six R-d levels for each model were presented to subjects who were asked whether the two signals in each pair differed. Model pairs LFLM-LFCALM were judged similar when sharing the same R-d value, and LF was considered the same as LFLM and LFCALM given a consistent shift in R-d. Overall, the similarity between these models encourages the use of the simpler and more computationally efficient models LFCALM and LFLM in speech synthesis applications.
更多
查看译文
关键词
flow,perceptual equivalence,liljencrants–fant,models,linear-filter
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要