Classifying web videos using a global video descriptor

Berkan Solmaz,Mubarak Shah,Shayan Modiri Assari

doi:10.1007/s00138-012-0449-x

Abstract

Computing descriptors for videos is a crucial task in computer vision. In this paper, we propose a global video descriptor for classification of videos. Our method, bypasses the detection of interest points, the extraction of local video descriptors and the quantization of descriptors into a code book; it represents each video sequence as a single feature vector. Our global descriptor is computed by applying a bank of 3-D spatio-temporal filters on the frequency spectrum of a video sequence; hence, it integrates the information about the motion and scene structure. We tested our approach on three datasets, KTH (Schuldt et al., Proceedings of the 17th international conference on, pattern recognition (ICPR'04), vol. 3, pp. 32---36, 2004), UCF50 ( http://vision.eecs.ucf.edu/datasetsActions.html ) and HMDB51 (Kuehne et al., HMDB: a large video database for human motion recognition, 2011), and obtained promising results which demonstrate the robustness and the discriminative power of our global video descriptor for classifying videos of various actions. In addition, the combination of our global descriptor and a local descriptor resulted in the highest classification accuracies on UCF50 and HMDB51 datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Classifying web videos using a global video descriptor

Abstract

Talk to us

Similar Papers

More From: Machine Vision and Applications

Lead the way for us

Journal: Machine Vision and Applications	Publication Date: Sep 26, 2012
Citations: 88

Similar Papers

Performance analysis of various local and global shape descriptors for image retrieval
Chandan Singh ... Pooja Sharma
Multimedia Systems | VOL. 19
Chandan Singh, et. al.Chandan Singh ... Pooja Sharma
09 Aug 2012
Multimedia Systems | VOL. 19

Leveraging local and global descriptors in parallel to search correspondences for visual localization
Pengju Zhang ... Yihong Wu
Pattern Recognition | VOL. 122
Pengju Zhang, et. al.Pengju Zhang ... Yihong Wu
22 Sep 2021
Pattern Recognition | VOL. 122

An Efficient Retrieval Technique for Trademarks Based on the Fuzzy Inference System
Chin-Sheng Chen ... Chi-Min Weng
Applied Sciences | VOL. 7
Chin-Sheng Chen, et. al.Chin-Sheng Chen ... Chi-Min Weng
18 Aug 2017
Applied Sciences | VOL. 7

3D ear recognition using global and local features
Iyyakutti Iyappan Ganapathi ... Surya Prakash
IET Biometrics | VOL. 7
Iyyakutti Iyappan Ganapathi, et. al.Iyyakutti Iyappan Ganapathi ... Surya Prakash
19 Mar 2018
IET Biometrics | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Classifying web videos using a global video descriptor

Abstract

Talk to us

Similar Papers

More From: Machine Vision and Applications