MSG-CAM:Multi-scale inputs make a better visual interpretation of CNN networks
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME(2023)
Abstract
The visualization of deep learning models has been widely studied as an effective means of exploring the decision-making processes within these models. However, current visualization methods suffer from several limitations, such as low resolution and poor visualization of multiple occurrences of the same class. In this paper, we propose a novel visualization technique called MSG-CAM, which is an improvement on the existing Group-CAM method. Our method uses the feature maps and gradients of the last layer of the convolutional neural network to create masks through multi-scale enlargement of the original input image and fusion of the resulting feature maps and gradients. Through both qualitative and quantitative analysis, we have demonstrated that the saliency maps generated by our method are more reasonable and accurately reflect the internal decision-making processes of the neural network.
MoreTranslated text
Key words
Interpretability,CAM,CNN
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined