RescueSpeech: A German Corpus for Speech Recognition in Search and Rescue Domain

Sangeet Sagar,Mirco Ravanelli, Bernd Kiefer, Ivana Kruijff Korbayova,Josef van Genabith

CoRR(2023)

引用 0|浏览21
暂无评分
摘要
Despite recent advancements in speech recognition, there are still difficulties in accurately transcribing conversational and emotional speech in noisy and reverberant acoustic environments. This poses a particular challenge in the search and rescue (SAR) domain, where transcribing conversations among rescue team members is crucial to support real-time decision-making. The scarcity of speech data and associated background noise in SAR scenarios make it difficult to deploy robust speech recognition systems. To address this issue, we have created and made publicly available a German speech dataset called RescueSpeech. This dataset includes real speech recordings from simulated rescue exercises. Additionally, we have released competitive training recipes and pre-trained models. Our study indicates that the current level of performance achieved by state-of-the-art methods is still far from being acceptable.
更多
查看译文
关键词
speech recognition,search and rescue,noise robustness
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要