Predicting Human Scanpaths in Visual Question Answering (Supplementary Materials)

semanticscholar(2021)

引用 0|浏览4
暂无评分
摘要
1) We present additional results to investigate the effects of hyperparameters, visual encoder backbones, machine attention mechanisms, and more (Section 2). These results suggest that our method is not only generalizable across multiple tasks, but also flexible to work with different visual encoders and task guidance maps. The results also suggest that our predicted scanpaths can fixate task-relevant objects in both VQA and visual search.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要