Polemad-A Database For The Multimodal Analysis Of Polish Pronunciation

SPEECH COMMUNICATION(2021)

Cited 1|Views7
No score
Abstract
The structure and functionality of the POLEMAD database constructed on the basis of a study using Electromagnetic Articulograph AG 500, an acoustic camera, and 3 video cameras are described in the paper. The article describes also data types stored in the database including speaker data, EMA data, video and sound recordings, phonetic information, and dynamic Bayesian network (DBN) models. The database allows for selective extraction of various types of samples for further analysis, which is performed by SQL queries generated in MATLAB (R) using Database Toolbox (TM). The possibilities of potential future application of the database in statistical analysis and automation of experiments on speech inversion using DBN are described in the paper as well.
More
Translated text
Key words
Database, Electromagnetic articulography, Video camera, Acoustic camera, Speech inversion, Dynamic Bayesian networks
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined