Skeleton-based human action recognition with a physics-augmented encoder-decoder network

Hongji Guo,Alexander Aved,Collen Roller,Erika Ardiles-Cruz,Qiang Ji,Joshua D Harguess

doi:10.1117/12.2664115

Abstract

Human action recognition is important for many applications such as surveillance monitoring, safety, and healthcare. As 3D body skeletons can accurately characterize body actions and are robust to camera views, we propose a 3D skeleton-based human action method. Different from the existing skeleton-based methods that use only geometric features for action recognition, we propose a physics-augmented encoder and decoder model that produces physically plausible geometric features for human action recognition. Specifically, given the input skeleton sequence, the encoder performs a spatiotemporal graph convolution to produce spatiotemporal features for both predicting human actions and estimating the generalized positions and forces of body joints. The decoder, implemented as an ODE solver, takes the joint forces and solves the Euler-Lagrangian equation to reconstruct the skeletons in the next frame. By training the model to simultaneously minimize the action classification and the 3D skeleton reconstruction errors, the encoder is ensured to produce features that are consistent with both body skeletons and the underlying body dynamics as well as being discriminative. The physics-augmented spatiotemporal features are used for human action classification. We evaluate the proposed method on NTU-RGB+D, a large-scale dataset for skeleton-based action recognition. Compared with existing methods, our method achieves higher accuracy and better generalization ability.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Skeleton-based human action recognition with a physics-augmented encoder-decoder network

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A Comparison Between Various Human Detectors and CNN-Based Feature Extractors for Human Activity Recognition via Aerial Captured Video Sequences
Nouar Aldahoul ... Mhd Adel Momo
IEEE Access | VOL. 10
Nouar Aldahoul, et. al.Nouar Aldahoul ... Mhd Adel Momo
01 Jan 2021
IEEE Access | VOL. 10

Graph-based approach for 3D human skeletal action recognition
Meng Li ... Howard Leung
Pattern Recognition Letters | VOL. 87
Meng Li, et. al.Meng Li ... Howard Leung
03 Aug 2016
Pattern Recognition Letters | VOL. 87

Human Action Representation Learning Using an Attention-Driven Residual 3DCNN Network
Hayat Ullah ... Arslan Munir
Algorithms | VOL. 16
Hayat Ullah, et. al.Hayat Ullah ... Arslan Munir
31 Jul 2023
Algorithms | VOL. 16

A survey of video datasets for human action and activity recognition
Jose M Chaquet ... Antonio Fernández-Caballero
Computer Vision and Image Understanding | VOL. 117
Jose M Chaquet, et. al.Jose M Chaquet ... Antonio Fernández-Caballero
13 Feb 2013
Computer Vision and Image Understanding | VOL. 117

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Skeleton-based human action recognition with a physics-augmented encoder-decoder network

Abstract

Talk to us

Similar Papers