Deep Architectures and Ensembles for Semantic Video Classification

Eng-Jon Ong,Syed Sameed Husain,Mikel Bober-Irizar,Miroslaw Bober

doi:10.1109/tcsvt.2018.2881842

Eng-Jon Ong, Syed Sameed Husain + Show 2 more

Open Access

https://doi.org/10.1109/tcsvt.2018.2881842

Copy DOI

Abstract

This work addresses the problem of accurate semantic labelling of short videos. To this end, a multitude of different deep nets, ranging from traditional recurrent neural networks (LSTM, GRU), temporal agnostic networks (FV,VLAD,BoW), fully connected neural networks mid-stage AV fusion and others. Additionally, we also propose a residual architecture-based DNN for video classification, with state-of-the art classification performance at significantly reduced complexity. Furthermore, we propose four new approaches to diversity-driven multi-net ensembling, one based on fast correlation measure and three incorporating a DNN-based combiner. We show that significant performance gains can be achieved by ensembling diverse nets and we investigate factors contributing to high diversity. Based on the extensive YouTube8M dataset, we provide an in-depth evaluation and analysis of their behaviour. We show that the performance of the ensemble is state-of-the-art achieving the highest accuracy on the YouTube8M Kaggle test data. The performance of the ensemble of classifiers was also evaluated on the HMDB51 and UCF101 datasets, and show that the resulting method achieves comparable accuracy with state-ofthe- art methods using similar input features.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: Dec 1, 2019
Citations: 14	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

Deep Architectures and Ensembles for Semantic Video Classification

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Similar Papers

Short-term Wind Power Prediction Model Based on GSA Optimized GRU Neural Network
Qingyun Xie ... Yongkang He
-
Qingyun Xie, et. al.Qingyun Xie ... Yongkang He
06 Nov 2020
06 Nov 2020

Memory analysis for memristors and memristive recurrent neural networks
Gang Bao ... Yide Zhang
IEEE/CAA Journal of Automatica Sinica | VOL. 7
Gang Bao, et. al.Gang Bao ... Yide Zhang
01 Jan 2020
IEEE/CAA Journal of Automatica Sinica | VOL. 7

Prediction of CO concentration in different conditions based on Gaussian-TCN
Pengfei Jia ... Sen Ni
Sensors and Actuators: B. Chemical | VOL. 376
Pengfei Jia, et. al.Pengfei Jia ... Sen Ni
17 Nov 2022
Sensors and Actuators: B. Chemical | VOL. 376

Sentiment analysis and research based on two‐channel parallel hybrid neural network model with attention mechanism
Yanqiu Sun ... Yan Yan
IET Control Theory & Applications | VOL. 17
Yanqiu Sun, et. al.Yanqiu Sun ... Yan Yan
05 Apr 2023
IET Control Theory & Applications | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Architectures and Ensembles for Semantic Video Classification

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology