谷歌Chrome浏览器插件
订阅小程序
在清言上使用

Video Action Recognition Based On Improved 3d Convolutional Network And Sparse Representation Classification

2019 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE(2019)

引用 1|浏览5
暂无评分
摘要
In view of the problem that the typical convolutional neural networks fail to model actions at their full temporal extent, a novel video action recognition algorithm, which is based on improved 3D Convolutional Network (iC3D) architecture with K-means keyframes extraction and sparse representation classification (SRC), is proposed in this study. During the feature extraction process, the K-means keyframes extraction is constrained to reduce redundant information generated by continuous video frames and increase the temporal acceptance region. Meanwhile, to improve the noise immunity, sparse coding and its reconstruction errors are used for classification. The proposed method has 96.5% recognition accuracy on the typical video action classification dataset UCF101 that outperforms other competing methods. In addition, we built a wild test dataset to verify the generalization performance of the proposed model.
更多
查看译文
关键词
video action recognition, improved 3D convolutional networks, keyframes extraction, sparse representation classification
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要