Improved Speaker and Navigator for Vision-and-Language Navigation

IEEE MultiMedia(2021)

引用 4|浏览8
暂无评分
摘要
Prior works in vision-and-language navigation (VLN) focus on using long short-term memory (LSTM) to carry the flow of information on either the navigation model (navigator) or the instruction generating model (speaker).The outstanding capability of LSTM to process intermodal interactions has been widely verified; however, LSTM neglects the intramodel interactions, leading to negative effect on eit...
更多
查看译文
关键词
Navigation,Visualization,Decoding,Trajectory,Task analysis,Feature extraction,Head
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要