A Scalable and Accurate Descriptor for Dynamic Textures Using Bag of System Trees

Pattern Analysis and Machine Intelligence, IEEE Transactions  (2015)

引用 36|浏览53
暂无评分
摘要
The bag-of-systems (BoS) representation is a descriptor of motion in a video, where dynamic texture (DT) codewords represent the typical motion patterns in spatio-temporal patches extracted from the video. The efficacy of the BoS descriptor depends on the richness of the codebook, which depends on the number of codewords in the codebook. However, for even modest sized codebooks, mapping videos onto the codebook results in a heavy computational load. In this paper we propose the BoS Tree, which constructs a bottom-up hierarchy of codewords that enables efficient mapping of videos to the BoS codebook. By leveraging the tree structure to efficiently index the codewords, the BoS Tree allows for fast look-ups in the codebook and enables the practical use of larger, richer codebooks. We demonstrate the effectiveness of BoS Trees on classification of four video datasets, as well as on annotation of a video dataset and a music dataset. Finally, we show that, although the fast look-ups of BoS Tree result in different descriptors than BoS for the same video, the overall distance (and kernel) matrices are highly correlated resulting in similar classification performance.
更多
查看译文
关键词
feature extraction,image classification,image motion analysis,image representation,image texture,matrix algebra,music,trees (mathematics),video coding,bos representation,dt codewords,bag of system trees,bag-of-system representation,distance matrices,dynamic texture codewords,dynamic textures,motion patterns,music dataset annotation,spatio-temporal patch extraction,tree structure,video dataset annotation,video dataset classification,video mapping,bag of systems,dynamic texture recognition,efficient indexing,large codebooks,music annotation,video annotation,vegetation,histograms,indexing,vectors,clustering algorithms
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要