Action recognition with spatio-temporal augmented descriptor and fusion method

Lijun Li,Shuling Dai

doi:10.1007/s11042-016-3789-0

Abstract

Action recognition is one of the most popular fields of computer vision, and lots of efforts have been made to improve recognition accuracy. While multiple descriptors are extracted to represent action, the spatio-temporal information is lost. In order to incorporate spatio-temporal information, we propose a novel method called augmented descriptor by adding the information to the original descriptor. As descriptors represent different video features, such as static appearance and motion information, previous methods just concatenate various descriptors. However, we propose a fusion method to boost the recognition accuracy of action recognition. The Multiple Kernel Learning is utilized to fuse different descriptors to get better representation in our fusion method. We also evaluate the contribution of normalization method to recognition accuracy. Our proposed methods are tested on the benchmark datasets, Olympic Sports dataset and HMDB51 dataset. The experimental results show that our approaches outperform the baseline method of improved trajectories and are effective in recognizing various actions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Action recognition with spatio-temporal augmented descriptor and fusion method

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications

Lead the way for us

Journal: Multimedia Tools and Applications	Publication Date: Jul 29, 2016
Citations: 7

Similar Papers

Various frameworks for integrating image and video streams for spatiotemporal information learning employing 2D–3D residual networks for human action recognition
Shaimaa Yosry ... Rania R Ziedan
Discover Applied Sciences | VOL. 6
Shaimaa Yosry, et. al.Shaimaa Yosry ... Rania R Ziedan
18 Mar 2024
Discover Applied Sciences | VOL. 6

Histogram of Oriented Gradient-Based Fusion of Features for Human Action Recognition in Action Video Sequences.
Chirag I Patel ... Muhammad Awais
Sensors (Basel, Switzerland) | VOL. 20
Chirag I Patel, et. al.Chirag I Patel ... Muhammad Awais
18 Dec 2020
Sensors (Basel, Switzerland) | VOL. 20

Improving the spatiotemporal fusion accuracy of fractional vegetation cover in agricultural regions by combining vegetation growth models
Guofeng Tao ... Xiaotong Zhang
International Journal of Applied Earth Observation and Geoinformation | VOL. 101
Guofeng Tao, et. al.Guofeng Tao ... Xiaotong Zhang
21 May 2021
International Journal of Applied Earth Observation and Geoinformation | VOL. 101

Shuffle-invariant Network for Action Recognition in Videos
Qinghongya Shi ... Jing-Hua Liu
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 18
Qinghongya Shi, et. al.Qinghongya Shi ... Jing-Hua Liu
04 Mar 2022
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Action recognition with spatio-temporal augmented descriptor and fusion method

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications