Spatio-Temporal Laplacian Pyramid Coding for Action Recognition

Ling Shao,Dacheng Tao,Xuelong Li,Xiantong Zhen

doi:10.1109/tcyb.2013.2273174

Abstract

We present a novel descriptor, called spatio-temporal Laplacian pyramid coding (STLPC), for holistic representation of human actions. In contrast to sparse representations based on detected local interest points, STLPC regards a video sequence as a whole with spatio-temporal features directly extracted from it, which prevents the loss of information in sparse representations. Through decomposing each sequence into a set of band-pass-filtered components, the proposed pyramid model localizes features residing at different scales, and therefore is able to effectively encode the motion information of actions. To make features further invariant and resistant to distortions as well as noise, a bank of 3-D Gabor filters is applied to each level of the Laplacian pyramid, followed by max pooling within filter bands and over spatio-temporal neighborhoods. Since the convolving and pooling are performed spatio-temporally, the coding model can capture structural and motion information simultaneously and provide an informative representation of actions. The proposed method achieves superb recognition rates on the KTH, the multiview IXMAS, the challenging UCF Sports, and the newly released HMDB51 datasets. It outperforms state of the art methods showing its great potential on action recognition.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Spatio-Temporal Laplacian Pyramid Coding for Action Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Cybernetics

Lead the way for us

Journal: IEEE Transactions on Cybernetics	Publication Date: Jul 31, 2013
Citations: 213

Similar Papers

A local descriptor based on Laplacian pyramid coding for action recognition
Xiantong ... Ling Shao
Pattern Recognition Letters | VOL. 34
Xiantong , et. al.Xiantong ... Ling Shao
10 Nov 2012
Pattern Recognition Letters | VOL. 34

Embedding Motion and Structure Features for Action Recognition
Xiantong Zhen ... Dacheng Tao
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 23
Xiantong Zhen, et. al.Xiantong Zhen ... Dacheng Tao
01 Jul 2013
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 23

Action recognition by spatio-temporal oriented energies
Xiantong Zhen ... Xuelong Li
Information Sciences | VOL. 281
Xiantong Zhen, et. al.Xiantong Zhen ... Xuelong Li
01 Jun 2014
Information Sciences | VOL. 281

Action representation and recognition through temporal co-occurrence of flow fields and convolutional neural networks
Hatem A Rashwan ... Domenec Puig
Multimedia Tools and Applications | VOL. 79
Hatem A Rashwan, et. al.Hatem A Rashwan ... Domenec Puig
01 Jul 2020
Multimedia Tools and Applications | VOL. 79

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Spatio-Temporal Laplacian Pyramid Coding for Action Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Cybernetics