Visual Understanding and Narration: A Deeper Understanding and Explanation of Visual Scenes.

arXiv: Computation and Language(2019)

引用 0|浏览0
暂无评分
摘要
We describe the task of Visual Understanding and Narration, in which a robot (or agent) generates text for the images that it collects when navigating its environment, by answering open-ended questions, such as 'what happens, or might have happened, here?'
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要