A novel keyframe extraction method for video classification using deep neural networks

Rukiye Savran Kızıltepe,John Q Gan,Juan José Escobar

doi:10.1007/s00521-021-06322-x

Rukiye Savran Kızıltepe, John Q Gan + Show 1 more

Open Access

https://doi.org/10.1007/s00521-021-06322-x

Copy DOI

Abstract

Combining convolutional neural networks (CNNs) and recurrent neural networks (RNNs) produces a powerful architecture for video classification problems as spatial–temporal information can be processed simultaneously and effectively. Using transfer learning, this paper presents a comparative study to investigate how temporal information can be utilized to improve the performance of video classification when CNNs and RNNs are combined in various architectures. To enhance the performance of the identified architecture for effective combination of CNN and RNN, a novel action template-based keyframe extraction method is proposed by identifying the informative region of each frame and selecting keyframes based on the similarity between those regions. Extensive experiments on KTH and UCF-101 datasets with ConvLSTM-based video classifiers have been conducted. Experimental results are evaluated using one-way analysis of variance, which reveals the effectiveness of the proposed keyframe extraction method in the sense that it can significantly improve video classification accuracy.

Highlights

Video has become more popular in many applications in recent years due to increased storage capacity, more advanced network architectures, as well as easy access to digital cameras, especially in mobile phones
This paper presents a comparative study to investigate how temporal information can be utilized to improve the performance of video classification when convolutional neural networks (CNNs) and recurrent neural networks (RNNs) are combined in various architectures
To enhance the performance of the identified architecture for effective combination of CNN and RNN, a novel action template-based keyframe extraction method is proposed by identifying the informative region of each frame and selecting keyframes based on the similarity between those regions

Summary

Introduction

Video has become more popular in many applications in recent years due to increased storage capacity, more advanced network architectures, as well as easy access to digital cameras, especially in mobile phones. More than 500 h of video is uploaded onto the Internet every minute and sharp rise in the number of videos is expected to continue in the coming decades due to the increase in demand for video content [1]. This increase is a remarkable issue and brings serious. Combining CNNs and RNNs has achieved good results [7, 8], the representation of temporal information is still a demanding problem due to complex variations in actions and dynamic background in videos

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Neural Computing and Applications	Publication Date: Aug 2, 2021
Citations: 19	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A novel keyframe extraction method for video classification using deep neural networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Neural Computing and Applications

Lead the way for us

Similar Papers

Recurrent convolutional neural network for answer selection in community question answering
Xiaoqiang Zhou ... Xiaolong Wang
Neurocomputing | VOL. 274
Xiaoqiang Zhou, et. al.Xiaoqiang Zhou ... Xiaolong Wang
11 Apr 2017
Neurocomputing | VOL. 274

Combining Very Deep Convolutional Neural Networks and Recurrent Neural Networks for Video Classification
Rukiye Savran Kızıltepe ... John Q Gan
-
Rukiye Savran Kızıltepe, et. al.Rukiye Savran Kızıltepe ... John Q Gan
01 Jan 2019
01 Jan 2019

EMG‐Based Estimation of Limb Movement Using Deep Learning With Recurrent Convolutional Neural Networks
Peng Xia ... Jie Hu
Artificial Organs | VOL. 42
Peng Xia, et. al.Peng Xia ... Jie Hu
25 Oct 2017
Artificial Organs | VOL. 42

A Concurrent CNN-RNN Approach for Multi-step Wind Power Forecasting
Syed Kazmi
-
Syed KazmiSyed Kazmi
19 Jun 2024
19 Jun 2024

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A novel keyframe extraction method for video classification using deep neural networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Neural Computing and Applications