Two-stream Network Research Articles

The visible-infrared pedestrian re- identification (VI Re-ID) task aims to match cross-modality pedestrian images with the same labels. Most current methods focus on mitigating the modality discrepancy by adopting a two-stream network and identity supervision. Based on current methods, we propose a novel feature fusion and center aggregation learning network (<inline-formula> <tex-math notation="LaTeX">$F^{2}$ </tex-math></inline-formula>CALNet) for cross-modality pedestrian re- identification. <inline-formula> <tex-math notation="LaTeX">$F^{2}$ </tex-math></inline-formula>CALNet focuses on learning modality-irrelevant features by simultaneously reducing inter-modality discrepancies and increasing the inter-identity variations in a single framework. Specifically, we first adopt a two-stream backbone network to extract modality-independent and modality-shared information. Then, we embed modality mitigation modules in a two-stream network to learn feature maps that are stripped of the modality information. Finally, we devise a feature fusion and center aggregation learning module, which first merges two different granularity features to learn distinguishing features, then, we organize two kinds of center-based loss functions to reduce the intra-identity inter- and intra-modality differences and increase inter-identity variations by simultaneously pulling the features of the same identity close to their centers and pushing far away the centers of different identities. Extensive experiments on two public cross-modality datasets (SYSU-MM01 and RegDB) show that <inline-formula> <tex-math notation="LaTeX">$F^{2}$ </tex-math></inline-formula>CALNet is superior to the state-of-the-art approaches. Furthermore, on the SYSU-MM01 datasets, our model outperforms the baseline by 5.52% and 4.25% for the accuracy of rank1 and mAP, respectively.

In this work, we propose a strong two-stream baseline method referred to as GeometryMotion-Net for 3D action recognition. For efficient 3D action recognition, we first represent each point cloud sequence as a limited number of randomly sampled frames with each frame consisting of a sparse set of points. After that, we propose a new two-stream framework for effective 3D action recognition. For the geometry stream, we propose a new module to produce a virtual overall geometry point cloud by first merging all 3D points from these selected frames, and then we exploit local neighborhood information of each point in the feature space. In the motion stream, for any two neighboring point cloud frames, we also propose a new module to generate one virtual forward motion point cloud and one virtual backward motion point cloud. Specifically, for each point in the current frame, we first produce a set of 3D offset features relative to the neighboring points in the reference frame ( <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">i.e.</i> , the previous/subsequent frame) and then exploit local neighborhood information of this point in the offset feature space. Based on the newly generated virtual overall geometry point cloud and multiple virtual forward/backward motion point clouds, any existing point cloud analysis methods ( <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">e.g.</i> , PointNet) can be readily adopted for extracting discriminant geometry and bidirectional motion features in the geometry and motion streams, respectively, which are further aggregated to make our two-stream network trainable in an end-to-end fashion. Comprehensive experiments on both large-scale datasets ( <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">i.e.</i> NTU RGB+D 60 and NTU RGB+D 120) and small-scale datasets ( <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">i.e.</i> , N-UCLA and UWA3DII) demonstrate the effectiveness and efficiency of our two-stream network for 3D action recognition.

Two-stream Network Research Articles

Related Topics

Articles published on Two-stream Network

Feature Fusion and Center Aggregation for Visible-Infrared Person Re-Identification

A Two-Stream Network Using Global Feature Pyramid and Spatial Attention Blockfor Human Action Recognition

Unsupervised Single-Scene Semantic Segmentation for Earth Observation

Satellite Video Scene Classification Using Low-Rank Sparse Representation Two-Stream Networks

Spatial–Temporal Gray-Level Co-Occurrence Aware CNN for SAR Image Change Detection

Motion Enhanced Model Based on High-Level Spatial Features

TANet: An Unsupervised Two-Stream Autoencoder Network for Hyperspectral Unmixing

Two-stream attention network with local and non-local dependence for referring relationships

Recognize highly similar sewing gestures by the robot

An Efficient Dual Spatial–Spectral Fusion Network

Feature Alignment for Robust Acoustic Scene Classification Across Devices

Visible-Infrared Person Re-Identification via Partially Interactive Collaboration.

Contrastive Proposal Extension With LSTM Network for Weakly Supervised Object Detection.

Learning Meta Pattern for Face Anti-Spoofing

Spectral Variability Augmented Two-Stream Network for Hyperspectral Sparse Unmixing

Document images forgery localization using a two‐stream network

Two-stream encoder–decoder network for localizing image forgeries

An Efficient Human Instance-Guided Framework for Video Action Recognition.

Two-stream network with phase map for few-shot classification

GeometryMotion-Net: A Strong Two-Stream Baseline for 3D Action Recognition

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Two-stream Network Research Articles

Related Topics

Articles published on Two-stream Network

Feature Fusion and Center Aggregation for Visible-Infrared Person Re-Identification

A Two-Stream Network Using Global Feature Pyramid and Spatial Attention Blockfor Human Action Recognition

Unsupervised Single-Scene Semantic Segmentation for Earth Observation

Satellite Video Scene Classification Using Low-Rank Sparse Representation Two-Stream Networks

Spatial–Temporal Gray-Level Co-Occurrence Aware CNN for SAR Image Change Detection

Motion Enhanced Model Based on High-Level Spatial Features

TANet: An Unsupervised Two-Stream Autoencoder Network for Hyperspectral Unmixing

Two-stream attention network with local and non-local dependence for referring relationships

Recognize highly similar sewing gestures by the robot

An Efficient Dual Spatial–Spectral Fusion Network

Feature Alignment for Robust Acoustic Scene Classification Across Devices

Visible-Infrared Person Re-Identification via Partially Interactive Collaboration.

Contrastive Proposal Extension With LSTM Network for Weakly Supervised Object Detection.

Learning Meta Pattern for Face Anti-Spoofing

Spectral Variability Augmented Two-Stream Network for Hyperspectral Sparse Unmixing

Document images forgery localization using a two‐stream network

Two-stream encoder–decoder network for localizing image forgeries

An Efficient Human Instance-Guided Framework for Video Action Recognition.

Two-stream network with phase map for few-shot classification

GeometryMotion-Net: A Strong Two-Stream Baseline for 3D Action Recognition