Explore human parsing modality for action recognition

Jinfu Liu,Shen Zhao,Fang‐Lue Zhang,Mengyuan Liu,Yuhang Wen,Runwei Ding,Fanyang Meng,Nan Dai

doi:10.1049/cit2.12366

Abstract

AbstractMultimodal‐based action recognition methods have achieved high success using pose and RGB modality. However, skeletons sequences lack appearance depiction and RGB images suffer irrelevant noise due to modality limitations. To address this, the authors introduce human parsing feature map as a novel modality, since it can selectively retain effective semantic features of the body parts while filtering out most irrelevant noise. The authors propose a new dual‐branch framework called ensemble human parsing and pose network (EPP‐Net), which is the first to leverage both skeletons and human parsing modalities for action recognition. The first human pose branch feeds robust skeletons in the graph convolutional network to model pose features, while the second human parsing branch also leverages depictive parsing feature maps to model parsing features via convolutional backbones. The two high‐level features will be effectively combined through a late fusion strategy for better action recognition. Extensive experiments on NTU RGB + D and NTU RGB + D 120 benchmarks consistently verify the effectiveness of our proposed EPP‐Net, which outperforms the existing action recognition methods. Our code is available at https://github.com/liujf69/EPP‐Net‐Action.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: CAAI Transactions on Intelligence Technology	Publication Date: Aug 16, 2024
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Explore human parsing modality for action recognition

Abstract

Talk to us

Similar Papers

More From: CAAI Transactions on Intelligence Technology

Lead the way for us

Similar Papers

Graph Convolutional Neural Network for Human Action Recognition: A Comprehensive Survey
Tasweer Ahmad ... Lianwen Jin
IEEE Transactions on Artificial Intelligence | VOL. 2
Tasweer Ahmad, et. al.Tasweer Ahmad ... Lianwen Jin
01 Apr 2021
IEEE Transactions on Artificial Intelligence | VOL. 2

An efficient and lightweight multiperson activity recognition framework for robot-assisted healthcare applications
Syed Hammad Hussain Shah ... Ibrahim A Hameed
Expert Systems with Applications | VOL. 241
Syed Hammad Hussain Shah, et. al.Syed Hammad Hussain Shah ... Ibrahim A Hameed
22 Nov 2023
Expert Systems with Applications | VOL. 241

A Novel Two-Stream Transformer-Based Framework for Multi-Modality Human Action Recognition
Jing Shi ... Liangyin Chen
Applied Sciences | VOL. 13
Jing Shi, et. al.Jing Shi ... Liangyin Chen
05 Feb 2023
Applied Sciences | VOL. 13

Action recognition based on RGB and skeleton data sets: A survey
Rujing Yue ... Shaoyi Du
Neurocomputing | VOL. 512
Rujing Yue, et. al.Rujing Yue ... Shaoyi Du
20 Sep 2022
Neurocomputing | VOL. 512

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Explore human parsing modality for action recognition

Abstract

Talk to us

Similar Papers

More From: CAAI Transactions on Intelligence Technology