Laryngoscope8: Laryngeal Image Dataset And Classification Of Laryngeal Disease Based On Attention Mechanism

PATTERN RECOGNITION LETTERS(2021)

引用 7|浏览16
暂无评分
摘要
Laryngeal disease is a common disease worldwide. However, currently there are no public laryngeal image datasets, which hinders the development of automatic classification of laryngeal disease. In this work, we build a new laryngeal image dataset called Laryngoscope8, which comprises 3057 images of 1950 unique individuals, and the images have been labeled with one of eight labels (including seven pathological labels and one normal label) by professional otolaryngologists. We also propose a laryngeal disease classification method, which uses attention mechanism to obtain the critical area under the supervision of image labels for laryngeal disease classification. That is, we first train a CNN model to classify the laryngeal images. If the classification result is correct, the region with strong response is most likely a critical area. The regions with strong responses are used as training data to train an object localization model that can automatically locate the critical area. Given an image for classification, the trained object localization model is employed to locate the critical area. Then, the located critical area is employed for image classification. The entire process only requires image-level labels and does not require manual labeling of the critical area. Experiment results show that the proposed method achieves promising performance in laryngeal disease classification. (C) 2021 Elsevier B.V. All rights reserved.
更多
查看译文
关键词
Laryngeal image dataset, Laryngeal disease classification, Attention mechanism
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要