Robust video summarization algorithm using supervised machine learning

Sunil S Harakannanavar, Shaik Roshan Sameer, Vikash Kumar, Sunil Kumar Behera, Adithya V Amberkar,Veena I. Puranikmath

Global Transitions Proceedings(2022)

Cited 3|Views1
No score
Abstract
The proposed approach uses ResNet-18 for feature extraction and with the help of temporal interest proposals generated for the video sequences, generates a video summary. The ResNet-18 is a convolutional neural network with eighteen layers. The existing methods don't address the problem of the summary being temporally consistent. The proposed work aims to create a temporally consistent summary. The classification and regression module are implemented to get fixed length inputs of the combined features. After this, the non-maximum suppression algorithm is applied to reduce the redundancy and remove the video segments having poor quality and low confidence-scores. Video summaries are generated using the kernel temporal segmentation (KTS) algorithm which converts a given video segment into video shots. The two standard datasets TVSum and SumMe are used to evaluate the proposed model. It is seen that the F-score obtained on TVSum and SumMe dataset is 56.13 and 45.06 respectively.
More
Translated text
Key words
Convolutional Neural Networks,Kernel temporal segmentation,Non maximum Suppression,Redundancy,ResNet-18
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined