Two-stream temporal enhanced Fisher vector encoding for skeleton-based action recognition

Jun Tang,Baodi Liu,Wenhui Guo,Yanjiang Wang

doi:10.1007/s40747-022-00914-3

Abstract

The key to skeleton-based action recognition is how to extract discriminative features from skeleton data. Recently, graph convolutional networks (GCNs) are proven to be highly successful for skeleton-based action recognition. However, existing GCN-based methods focus on extracting robust features while neglecting the information of feature distributions. In this work, we aim to introduce Fisher vector (FV) encoding into GCN to effectively utilize the information of feature distributions. However, since the Gaussian Mixture Model (GMM) is employed to fit the global distribution of features, Fisher vector encoding inevitably leads to losing temporal information of actions, which is demonstrated by our analysis. To tackle this problem, we propose a temporal enhanced Fisher vector encoding algorithm (TEFV) to provide more discriminative visual representation. Compared with FV, our TEFV model can not only preserve the temporal information of the entire action but also capture fine-grained spatial configurations and temporal dynamics. Moreover, we propose a two-stream framework (2sTEFV-GCN) by combining the TEFV model with the GCN model to further improve the performance. On two large-scale datasets for skeleton-based action recognition, NTU-RGB+D 60 and NTU-RGB+D 120, our model achieves state-of-the-art performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Complex & Intelligent Systems	Publication Date: Nov 25, 2022
Citations: 1	License type: open-access

R Discovery Prime

R Discovery Prime

Two-stream temporal enhanced Fisher vector encoding for skeleton-based action recognition

Abstract

Talk to us

Similar Papers

More From: Complex & Intelligent Systems

Lead the way for us

Similar Papers

Effective Skeleton-Based Action Recognition by combining Graph Convolutional Networks and Fisher Vector Encoding
Jun Tang ... Baodi Liu
-
Jun Tang, et. al.Jun Tang ... Baodi Liu
06 Dec 2020
06 Dec 2020

A graph convolutional neural network model with Fisher vector encoding and channel‐wise spatial‐temporal aggregation for skeleton‐based action recognition
Jun Tang ... Weifeng Liu
IET Image Processing | VOL. 16
Jun Tang, et. al.Jun Tang ... Weifeng Liu
17 Jan 2022
IET Image Processing | VOL. 16

Skeleton-based Action Recognition with Multi-scale Spatial-temporal Convolutional Neural Network
Qin Cheng ... Qieshi Zhang
-
Qin Cheng, et. al.Qin Cheng ... Qieshi Zhang
15 Jul 2021
15 Jul 2021

An Attention Enhanced Graph Convolutional LSTM Network for Skeleton-Based Action Recognition
Chenyang Si ... Tieniu Tan
-
Chenyang Si, et. al.Chenyang Si ... Tieniu Tan
01 Jun 2019
01 Jun 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Two-stream temporal enhanced Fisher vector encoding for skeleton-based action recognition

Abstract

Talk to us

Similar Papers

More From: Complex & Intelligent Systems