Modeling spatio-temporal layout with Lie Algebrized Gaussians for action recognition

Meng Chen,Tianjiang Wang,Qi Feng,Liyu Gong,Fang Liu

doi:10.1007/s11042-015-3008-4

Abstract

We propose a novel approach to model spatio-temporal distribution of local features for action recognition in videos. The proposed approach is based on the Lie Algebrized Gaussians (LAG) which is a feature aggregation approach and yields high-dimensional video signature. In the framework of LAG, local features extracted from a video are aggregated to train a video-specific Gaussian Mixture Model (GMM). Then the video-specific GMM is encoded as a vector based on Lie group theory and this step is also referred to as GMM vectorization. As the video-specific GMM gives a soft partition of the feature space, for each cell of the feature space (i.e. each Gaussian component), we use a GMM to model the spatio-temporal locations of the local features assigned to the Gaussian component. The location GMMs are encoded as vectors just like the local feature GMM. We term those vectors of location GMMs spatio-temporal LAG (STLAG). In addition, although the LAG and the popular Fisher Vector (FV) are derived from distinct theory perspectives, we find that they are closely related. Hence the power and l2 normalization proposed for the FV are also beneficial to the LAG. Experimental results show that STLAG is very effective to model spatio-temporal layout compared with other techniques such as spatio-temporal pyramid and feature augmentation. Using the state-of-the-art dense trajectory features, our approach achieves state-of-the-art performance on two challenging datasets: Hollywood2 and HMDB51.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Modeling spatio-temporal layout with Lie Algebrized Gaussians for action recognition

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications

Lead the way for us

Journal: Multimedia Tools and Applications	Publication Date: Oct 26, 2015
Citations: 40

Similar Papers

Human action recognition in surveillance video of a computer laboratory
Abdul-Lateef Yussiff ... Yong Suet-Peng
-
Abdul-Lateef Yussiff, et. al.Abdul-Lateef Yussiff ... Yong Suet-Peng
01 Aug 2016
01 Aug 2016

Large Margin Dimensionality Reduction for Action Similarity Labeling
...
IEEE Signal Processing Letters | VOL. 21
, et. al. ...
01 Aug 2014
IEEE Signal Processing Letters | VOL. 21

Integrating Gaussian mixture model and dilated residual network for action recognition in videos
Ming Fang ... Jianwei Zhao
Multimedia Systems | VOL. 26
Ming Fang, et. al.Ming Fang ... Jianwei Zhao
20 Aug 2020
Multimedia Systems | VOL. 26

Understanding action recognition in still images
Deeptha Girish ... Anca Ralescu
-
Deeptha Girish, et. al.Deeptha Girish ... Anca Ralescu
01 Jun 2020
01 Jun 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Modeling spatio-temporal layout with Lie Algebrized Gaussians for action recognition

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications