Discriminative human action classification using locality-constrained linear coding

Pattern Recognition Letters(2016)

引用 49|浏览84
暂无评分
摘要
We propose using the locality-constrained linear coding for action classification.Our sequence descriptor includes cell, block, and subsequence descriptors.We use maximum pooling and a logistic regression classifier to encode each sequence.We demonstrate the effectiveness of our algorithm on both depth and RGB videos. We propose a Locality-constrained Linear Coding (LLC) based algorithm that captures discriminative information of human actions in spatio-temporal subsequences of videos. The input video is divided into equally spaced overlapping spatio-temporal subsequences. Each subsequence is further divided into blocks and then cells. The spatio-temporal information in each cell is represented by a Histogram of Oriented 3D Gradients (HOG3D). LLC is then used to encode each block. We show that LLC gives more stable and repetitive codes compared to the standard Sparse Coding. The final representation of a video sequence is obtained using logistic regression with 2 regularization and classification is performed by a linear SVM. The proposed algorithm is applicable to conventional and depth videos. Experimental comparison with ten state-of-the-art methods on three depth video and two conventional video databases shows that the proposed method consistently achieves the best performance.
更多
查看译文
关键词
Human action classification,Locality-constrained linear coding,Sparse coding,SVM classifier
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要