Chrome Extension
WeChat Mini Program
Use on ChatGLM

Mnasr: a free speech corpus for mongolian speech recognition and accompanied baselines

2022 25th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)(2022)

Cited 0|Views15
No score
Abstract
Thanks to the development of deep learning and the emergence of open source data sets, automatic speech recognition (ASR) has made great strides in mainstream languages such as Chinese and English. However, the research of ASR in Mongolian and other minority languages lags far behind the mainstream, due to low attention and limited open source data sets. To promote the development of new models and new methods for Mongolian ASR, this paper releases the MnASR database which contains 345 hours of Mongolian speech signal and the corresponding transcription. MnASR is the largest publicly available and free Mongolian speech database so far. Speech recognition baselines are made public at the same time. Both the database and the accompanied baselines are free for research purpose.
More
Translated text
Key words
Speech Recognition,Mongolian Dataset,Open Data
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined