Fine-Grained Video Captioning via Graph-based Multi-Granularity Interaction Learning

IEEE Transactions on Pattern Analysis and Machine Intelligence(2022)

引用 18|浏览171
暂无评分
摘要
Learning to generate continuous linguistic descriptions for multi-subject interactive videos in great details has particular applications in team sports auto-narrative. In contrast to traditional video caption, this task is more challenging as it requires simultaneous modeling of fine-grained individual actions, uncovering of spatio-temporal dependency structures of frequent group interactions, an...
更多
查看译文
关键词
Sports,Task analysis,Feature extraction,Linguistics,Games,Three-dimensional displays,Measurement
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要