Audio-based event detection in the operating room

Jonas Fuchtmann, Thomas Riedel,Maximilian Berlet,Alissa Jell, Luca Wegener,Lars Wagner, Simone Graf,Dirk Wilhelm, Daniel Ostler-Mildner

International Journal of Computer Assisted Radiology and Surgery(2024)

引用 0|浏览3
暂无评分
摘要
Even though workflow analysis in the operating room has come a long way, current systems are still limited to research. In the quest for a robust, universal setup, hardly any attention has been given to the dimension of audio despite its numerous advantages, such as low costs, location, and sight independence, or little required processing power. We present an approach for audio-based event detection that solely relies on two microphones capturing the sound in the operating room. Therefore, a new data set was created with over 63 h of audio recorded and annotated at the University Hospital rechts der Isar. Sound files were labeled, preprocessed, augmented, and subsequently converted to log-mel-spectrograms that served as a visual input for an event classification using pretrained convolutional neural networks. Comparing multiple architectures, we were able to show that even lightweight models, such as MobileNet, can already provide promising results. Data augmentation additionally improved the classification of 11 defined classes, including inter alia different types of coagulation, operating table movements as well as an idle class. With the newly created audio data set, an overall accuracy of 90
更多
查看译文
关键词
Audio signal classification,Event detection,Workflow analysis,Surgical sound analysis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要