谷歌Chrome浏览器插件
订阅小程序
在清言上使用

Bilinear Semi-Tensor Product Attention (BSTPA) model for visual question answering

2020 International Joint Conference on Neural Networks (IJCNN)(2020)

引用 6|浏览7
暂无评分
摘要
We propose a semi-tensor product attention network model as a visual question answering tool for complex interaction over image features. Proposed model performs matrix multiplication of two arbitrary dimensions, which is used to overcome possible dimensional limitations and improve recognition flexibility. In used block-wise operation we preserve spatial and temporal information but reduce the number of parameters by using low-rank pooling scheme. Applied BERT pre-train model is tuned to recognize question features. The proposed model is evaluated on the VQA2.0 dataset. Research results show that our model has good accuracy and easy reconfiguration for future research.
更多
查看译文
关键词
Feature extraction,Visualization,Knowledge discovery,Cognition,Task analysis,Computational modeling,Bit error rate
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要