Gradient-layer feature transform for action detection and recognition

Liangliang Wang,Ruifeng Li,Yajun Fang

doi:10.1016/j.jvcir.2016.06.023

Abstract

Exploring action feature representation in consecutive video frames is a basic but critical issue in the area of computer vision. This paper presents a principled technique transforming gradient-based features into coherent spatial-temporal descriptors for action detection and recognition. Specifically, Gaussian convolution based technique is first applied to extract spatial features of each image frame on gradient layer, based on which the spatial features are further processed according to the forward-backward frame difference and correspondence fusion between frames for frame sequence representation. Furthermore, region of actions is labeled via thresholding the projection of difference features in horizontal-vertical direction while action types are classified via learning the fused features. We evaluate our approach on samples from KTH, Weizmann, UCF Sports dataset and ChangeDetection.NET dataset 2014, which demonstrates its applicability and effectiveness.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Gradient-layer feature transform for action detection and recognition

Abstract

Talk to us

Similar Papers

More From: Journal of Visual Communication and Image Representation

Lead the way for us

Journal: Journal of Visual Communication and Image Representation	Publication Date: Jun 23, 2016
Citations: 5

Similar Papers

Action Recognition Based on Spatial Temporal Graph Convolutional Networks
Wanqiang Zheng ... Punan Jing
-
Wanqiang Zheng, et. al.Wanqiang Zheng ... Punan Jing
22 Oct 2019
22 Oct 2019

Refined Feature-based Multi-frame and Multi-scale Fusing Gate network for accurate segmentation of plaques in ultrasound videos
Xifeng Hu ... Shuo Li
Computers in Biology and Medicine | VOL. 163
Xifeng Hu, et. al.Xifeng Hu ... Shuo Li
07 Jun 2023
Computers in Biology and Medicine | VOL. 163

MSST-ResNet: Deep multi-scale spatiotemporal features for robust visual object tracking
Bing Liu ... Yong Yang
Knowledge-Based Systems | VOL. 164
Bing Liu, et. al.Bing Liu ... Yong Yang
09 Nov 2018
Knowledge-Based Systems | VOL. 164

Human Action Segmentation Based on a Streaming Uniform Entropy Slice Method
Cheng Peng ... Jie Huang
IEEE Access | VOL. 6
Cheng Peng, et. al.Cheng Peng ... Jie Huang
01 Jan 2018
IEEE Access | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Gradient-layer feature transform for action detection and recognition

Abstract

Talk to us

Similar Papers

More From: Journal of Visual Communication and Image Representation