A study on the waveform-based end-to-end deep convolutional neural network for weakly supervised sound event detection

Seokjin Lee,Minhan Kim, Youngho Jeong

JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA(2020)

引用 2|浏览0
暂无评分
摘要
In this paper, the deep convolutional neural network for sound event detection is studied. Especially, the end-to-end neural network, which generates the detection results from the input audio waveform, is studied for weakly supervised problem that includes weakly-labeled and unlabeled dataset. The proposed system is based on the network structure that consists of deeply-stacked 1-dimensional convolutional neural networks, and enhanced by the skip connection and gating mechanism. Additionally, the proposed system is enhanced by the sound event detection and post processings, and the training step using the mean-teacher model is added to deal with the weakly supervised data. The proposed system was evaluated by the Detection and Classification of Acoustic Scenes and Events (DCASE) 2019 Task 4 dataset, and the result shows that the proposed system has F-1-scores of 54 % (segment-based) and 32 % (event-based).
更多
查看译文
关键词
Sound event detection,Deep convolutional neural network,Weakly supervised training,End-to-end neural network
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要