谷歌Chrome浏览器插件
订阅小程序
在清言上使用

Intelligent Sparse Representations for Speech ∗

semanticscholar(2015)

引用 0|浏览2
暂无评分
摘要
We design a dictionary in which speech signals have a sparse representation. We utilize the property that speech is comprised of a fixed number of phonemes. The dictionary is a concatenation of the principal components of all these phonemes, and hence information about each phoneme is present in a block. Since any speech signal is a concatenation of phonemes, it can be represented as a linear combination of the columns of this dictionary. In particular, if we consider a small window of speech (containing no more than two phonemes), such a signal would ideally have a block sparse representation in the dictionary. The representation is obtained by solving a variation of the LASSO or basis pursuit denoising (BPDN) problem. We show that the representation is sparse enough to achieve compression. Finally, our intuition is that such a representation could also implicitly perform denoising.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要