MAPNet: Multi-modal attentive pooling network for RGB-D indoor scene classification.

Pattern Recognition(2019)

引用 22|浏览51
暂无评分
摘要
•Orderless pooling can maintain spatial invariance in local information aggregation for indoor scene classification.•Intra-modality Attentive Pooling mines and pools discriminative local semantic cues in each modality.•Cross-modality Attentive Pooling learns to attend on different modalities in terms of different local cues to fuse the selected discriminative semantic cues across modalities.•The attention weights in the model are interpretable for understanding both scene classification and RGB-D fusion.•State-of-the-art results are achieved on both challenging SUN RGB-D Dataset and NYU Depth V2 Dataset.
更多
查看译文
关键词
Indoor scene classification,Multi-modal fusion,RGB-D,Attentive pooling
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要