谷歌浏览器插件
订阅小程序
在清言上使用

Spoken language estimating device, method, and program

user-5e9d449e4c775e765d44d7c9(2011)

引用 2|浏览3
暂无评分
摘要
PROBLEM TO BE SOLVED: To estimate the kind of a language indicated by an input speech signal without converting into text-level language expressions, also without requiring previous knowledge.SOLUTION: In a spoken language estimating device, a phoneme expression calculation section 13 decomposes a mel spectrum extracted from a speech signal for learning with NMF (Non-negative Matrix Factorization) to obtain a phoneme expression H and a compound ratio U for each kind of language, and the phoneme expression H is stored in a phoneme expression storage section 14 for each kind of language. When a speech signal for estimating is input, a feature information extraction section 12 extracts a mel spectrum, and a phoneme compound ratio calculation section 15 calculates a compound ratio U on the basis of the extracted mel spectrum and the phoneme expressions H stored in the phoneme expression storage section 14, for each kind of language. A language similarity estimation section 16 calculates the product of the calculated compound ratio U and the phoneme expressions H stored in the phoneme expression storage section 14, for each kind of language, to estimate the kind of language indicated by the speech signal for estimating on the basis of similarity between the product and the mel spectrum extracted from the speech signal for estimating.
更多
查看译文
关键词
Spoken language,Feature (linguistics),Expression (mathematics),Information extraction,Non-negative matrix factorization,Matrix decomposition,Speech recognition,SIGNAL (programming language),Basis (linear algebra),Computer science
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要