GeometryMotion-Net: A Strong Two-Stream Baseline for 3D Action Recognition

Jiaheng Liu,Dong Xu

doi:10.1109/tcsvt.2021.3101847

Abstract

In this work, we propose a strong two-stream baseline method referred to as GeometryMotion-Net for 3D action recognition. For efficient 3D action recognition, we first represent each point cloud sequence as a limited number of randomly sampled frames with each frame consisting of a sparse set of points. After that, we propose a new two-stream framework for effective 3D action recognition. For the geometry stream, we propose a new module to produce a virtual overall geometry point cloud by first merging all 3D points from these selected frames, and then we exploit local neighborhood information of each point in the feature space. In the motion stream, for any two neighboring point cloud frames, we also propose a new module to generate one virtual forward motion point cloud and one virtual backward motion point cloud. Specifically, for each point in the current frame, we first produce a set of 3D offset features relative to the neighboring points in the reference frame ( <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">i.e.</i> , the previous/subsequent frame) and then exploit local neighborhood information of this point in the offset feature space. Based on the newly generated virtual overall geometry point cloud and multiple virtual forward/backward motion point clouds, any existing point cloud analysis methods ( <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">e.g.</i> , PointNet) can be readily adopted for extracting discriminant geometry and bidirectional motion features in the geometry and motion streams, respectively, which are further aggregated to make our two-stream network trainable in an end-to-end fashion. Comprehensive experiments on both large-scale datasets ( <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">i.e.</i> NTU RGB+D 60 and NTU RGB+D 120) and small-scale datasets ( <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">i.e.</i> , N-UCLA and UWA3DII) demonstrate the effectiveness and efficiency of our two-stream network for 3D action recognition.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

GeometryMotion-Net: A Strong Two-Stream Baseline for 3D Action Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society

Lead the way for us

Journal: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society	Publication Date: Dec 1, 2021
Citations: 14

Similar Papers

VirtualActionNet: A strong two-stream point cloud sequence network for human action recognition
Xing Li ... Tianjin Yang
Journal of Visual Communication and Image Representation | VOL. 89
Xing Li, et. al.Xing Li ... Tianjin Yang
23 Sep 2022
Journal of Visual Communication and Image Representation | VOL. 89

Virtual Namesake Point Multi-Source Point Cloud Data Fusion Based on FPFH Feature Difference.
Li Zheng ... Zhukun Li
Sensors (Basel, Switzerland) | VOL. 21
Li Zheng, et. al.Li Zheng ... Zhukun Li
12 Aug 2021
Sensors (Basel, Switzerland) | VOL. 21

PointMapNet: Point Cloud Feature Map Network for 3D Human Action Recognition
Xing Li ... Yunfei Zhang
Symmetry | VOL. 15
Xing Li, et. al.Xing Li ... Yunfei Zhang
30 Jan 2023
Symmetry | VOL. 15

Spatial prediction of species’ distributions from occurrence-only records: combining point pattern analysis, ENFA and regression-kriging
Tomislav Hengl ... Arta Dilo
Ecological Modelling | VOL. 220
Tomislav Hengl, et. al.Tomislav Hengl ... Arta Dilo
23 Jul 2009
Ecological Modelling | VOL. 220

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

GeometryMotion-Net: A Strong Two-Stream Baseline for 3D Action Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society