Twitter100k: A Real-world Dataset for Weakly Supervised Cross-Media Retrieval.

IEEE Transactions on Multimedia(2018)

引用 56|浏览83
暂无评分
摘要
This paper contributes a new large-scale dataset for weakly supervised cross-media retrieval, named Twitter100k. Current datasets, such as Wikipedia, NUS Wide, and Flickr30k, have two major limitations. First, these datasets are lacking in content diversity, i.e., only some predefined classes are covered. Second, texts in these datasets are written in well-organized language, leading to inconsiste...
更多
查看译文
关键词
Internet,Encyclopedias,Electronic publishing,Optical character recognition software,Visualization,Training
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要