谷歌浏览器插件
订阅小程序
在清言上使用

A flexible testing environment for visual question answering with performance evaluation.

Neurocomputing(2018)

引用 12|浏览28
暂无评分
摘要
In order to move toward efficient autonomous learning, we must have control over our datasets to test and adaptively train systems for complex problems such as Visual Question Answering (VQA). Thus, we created a testing environment around MNIST images with optional cluttering. Although less complex than publicly available VQA datasets, the new environment generates datasets that decouple answers from questions and incorporate abstract ideas (content, context, and arithmetic) that must be learned. In addition, we analyze the performance of merged CNNs and LSTMs using the environment while exploring different ways to incorporate pretrained object classifiers. We demonstrate the usefulness of our environment as well as provide insight on the limitations of simple architectures and the complexities of different questions.
更多
查看译文
关键词
Data environment,Deep learning,Visual Question Answering
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要