Visual Tracking With Fully Convolutional Networks

2015 IEEE International Conference on Computer Vision (ICCV)(2015)

引用 1233|浏览243
暂无评分
摘要
We propose a new approach for general object tracking with fully convolutional neural network. Instead of treating convolutional neural network (CNN) as a black-box feature extractor, we conduct in-depth study on the properties of CNN features offline pre-trained on massive image data and classification task on ImageNet. The discoveries motivate the design of our tracking system. It is found that convolutional layers in different levels characterize the target from different perspectives. A top layer encodes more semantic features and serves as a category detector, while a lower layer carries more discriminative information and can better separate the target from distracters with similar appearance. Both layers are jointly used with a switch mechanism during tracking. It is also found that for a tracking target, only a subset of neurons are relevant. A feature map selection method is developed to remove noisy and irrelevant feature maps, which can reduce computation redundancy and improve tracking accuracy. Extensive evaluation on the widely used tracking benchmark I I shows that the proposed tacker outperforms the state-of-the-art significantly.
更多
查看译文
关键词
visual tracking,convolutional neural network,CNN,black-box feature extractor,image classification,target tracking,semantic feature,category detector,switch mechanism,feature map selection method
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要