Depth Stream Research Articles

Person re-identification (reID) is a crucial aspect of intelligent surveillance systems, enabling the recognition of individuals across non-overlapping camera views. As compared to the RGB modality, RGB-D based reID has the potential to achieve robust and high performance by leveraging rich complementary features of both modalities, making it applicable in various occluded scenarios. However, current multimodal reID approaches often rely on late fusion or feature-level fusion techniques to combine multiple modalities, which limits their ability to proficiently exploit complementary visual and depth-related semantic information and capture complex interactions between unimodal features. To address these limitations, this paper introduces a cross-modality online transformer (CMOT) for RGB-D based person reID with online learning capabilities, which effectively utilizes both RGB and depth modalities for the extraction of spatio-temporal features and fuses across modalities. Our CMOT is composed of three main components: (1) a hypothesis generation module based on a person detector and tracker, (2) dual-stream feature extractors via convolutional neural networks (CNNs), (3) and a fusion transformer based on a self-attention-driven self-attentive modality refinement module (SAMR) and a cross-attention-driven cross-attentive modality interaction module (CAMI) to refine and fuse RGB-D complementary features extracted from the dual-stream, RGB and depth stream, CNNs. Additionally, we introduce a bottleneck enhancement feed-forward block to enhance the model’s representation capability within SAMR and CAMI, significantly reducing parameters and computations compared to the traditional feed-forward network. Moreover, we design the triplet loss function with distance measuring ability for incorporating online learning and finally CMOT works as a few-shot network for reID. Experimental results on three RGB-D person re-identification datasets, namely BIWI RGBD-ID, RobotPKU RGBD-ID, and TVPR2, demonstrate the effectiveness and robustness of CMOT.

Read full abstract

AbstractStream fish survival and recruitment are products of a physicochemical environment that affects growth and provides refuge; yet, the drivers of spatiotemporal variation in juvenile fish abundance remain unclear. Understanding how physicochemical conditions drive spatial and temporal patterns in fish abundances provides insight into how conditions across stream networks influence fish population success, thereby providing direction to managers about the types and locations of conservation actions that would be most beneficial. Using snorkel and habitat surveys of 120 sites sampled from 2015 to 2017, we evaluated the multiscale relationships among physicochemical features, hydrology, and age‐0 Smallmouth Bass (Micropterus dolomieu velox) abundance in relation to network spatial position. Abundance of age‐0 bass was spatiotemporally variable in relation to a July streamflow–network position interaction, a pool depth–stream size interaction, and a stream temperature–network position interaction. High flows at the end of the nesting season were related to lower age‐0 abundance, but this effect was dampened in stream reaches in close proximity to larger mainstems. In small streams, reaches with deeper pool habitat supported higher age‐0 bass abundances, but this trend was not apparent in larger tributaries and mainstem systems. Generally, colder streams had lower age‐0 Smallmouth Bass abundance, though this relationship was not apparent in reaches adjacent to larger streams that generally supported higher age‐0 bass abundances. Conservation actions that (1) facilitate habitat connectivity within and among streams, (2) limit future anthropogenic practices that alter natural geomorphology by creating shallower stream channels, and (3) maintain adequate flow magnitude and timing to support channel complexity (e.g., deeper pools within smaller catchments) would be most beneficial to supporting rearing habitat for age‐0 riverine Smallmouth Bass.

Read full abstract

Depth Stream Research Articles

Related Topics

Articles published on Depth Stream

3-D Convolutional Neural Networks for RGB-D Salient Object Detection and Beyond.

CUSCO: An Unobtrusive Custom Secure Audio-Visual Recording System for Ambient Assisted Living.

ECW-EGNet: Exploring Cross-ModalWeighting and edge-guided decoder network for RGB-D salient object detection

CMOT: A cross-modality transformer for RGB-D fusion in person re-identification with online learning capabilities

Depth cue enhancement and guidance network for RGB-D salient object detection

Evaluation of Hand Washing Procedure Using Vision-Based Frame Level and Spatio-Temporal Level Data Models

Light field angular super-resolution based on intrinsic and geometric information

FASFLNet: feature adaptive selection and fusion lightweight network for RGB-D indoor scene parsing.

SSEC: Semantic Segmentation and Ensemble Classification Framework for Static Hand Gesture Recognition using RGB-D Data

Encoder deep interleaved network with multi-scale aggregation for RGB-D salient object detection

When CNNs meet random RNNs: Towards multi-level analysis for RGB-D object and scene recognition

DMRA: Depth-Induced Multi-Scale Recurrent Attention Network for RGB-D Saliency Detection.

Single Depth View Based Real-Time Reconstruction of Hand-Object Interactions

Sign language recognition using Kinect sensor based on color stream and skeleton points

RGB-D Salient Object Detection via 3D Convolutional Neural Networks

CDNet: Complementary Depth Network for RGB-D Salient Object Detection.

Joint learning of convolution neural networks for RGB‐D‐based human action recognition

Age‐0 Smallmouth Bass abundance depends on physicochemical conditions and stream network position

TSNet: Three-Stream Self-Attention Network for RGB-D Indoor Semantic Segmentation

Multi-level progressive parallel attention guided salient object detection for RGB-D images

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Depth Stream Research Articles

Related Topics

Articles published on Depth Stream

3-D Convolutional Neural Networks for RGB-D Salient Object Detection and Beyond.

CUSCO: An Unobtrusive Custom Secure Audio-Visual Recording System for Ambient Assisted Living.

ECW-EGNet: Exploring Cross-ModalWeighting and edge-guided decoder network for RGB-D salient object detection

CMOT: A cross-modality transformer for RGB-D fusion in person re-identification with online learning capabilities

Depth cue enhancement and guidance network for RGB-D salient object detection

Evaluation of Hand Washing Procedure Using Vision-Based Frame Level and Spatio-Temporal Level Data Models

Light field angular super-resolution based on intrinsic and geometric information

FASFLNet: feature adaptive selection and fusion lightweight network for RGB-D indoor scene parsing.

SSEC: Semantic Segmentation and Ensemble Classification Framework for Static Hand Gesture Recognition using RGB-D Data

Encoder deep interleaved network with multi-scale aggregation for RGB-D salient object detection

When CNNs meet random RNNs: Towards multi-level analysis for RGB-D object and scene recognition

DMRA: Depth-Induced Multi-Scale Recurrent Attention Network for RGB-D Saliency Detection.

Single Depth View Based Real-Time Reconstruction of Hand-Object Interactions

Sign language recognition using Kinect sensor based on color stream and skeleton points

RGB-D Salient Object Detection via 3D Convolutional Neural Networks

CDNet: Complementary Depth Network for RGB-D Salient Object Detection.

Joint learning of convolution neural networks for RGB‐D‐based human action recognition

Age‐0 Smallmouth Bass abundance depends on physicochemical conditions and stream network position

TSNet: Three-Stream Self-Attention Network for RGB-D Indoor Semantic Segmentation

Multi-level progressive parallel attention guided salient object detection for RGB-D images