RGB-D Video Sequences Research Articles

Gait disabilities are among the most frequent impairments worldwide. Their treatment increasingly relies on rehabilitation therapies, in which smart walkers are being introduced to empower the user’s recovery state and autonomy, while reducing the clinicians effort. For that, these should be able to decode human motion and needs, as early as possible. Current walkers decode motion intention using information gathered from wearable or embedded sensors, namely inertial units, force sensors, hall sensors, and lasers, whose main limitations imply an expensive solution or hinder the perception of human movement. Smart walkers commonly lack an advanced and seamless human–robot interaction, which intuitively and promptly understands human motions. A contactless approach is proposed in this work, addressing human motion decoding as an early action recognition/detection problematic, using RGB-D cameras. We studied different deep learning-based algorithms, organised in three different approaches, to process lower body RGB-D video sequences, recorded from an embedded camera of a smart walker, and classify them into 4 classes (stop, walk, turn right/left). A custom dataset involving 15 healthy participants walking with the walker device was acquired and prepared, resulting in 28800 balanced RGB-D frames, to train and evaluate the deep learning networks. The best results were attained by a convolutional neural network with a channel-wise attention mechanism, reaching accuracy values of 99.61% and above 93%, for offline early detection/recognition and trial simulations, respectively. Following the hypothesis that human lower body features encode prominent information, fostering a more robust prediction towards real-time applications, the algorithm focus was also quantitatively evaluated using Dice metric, leading to values slightly higher than 30%. Promising results were attained for early action detection as a human motion decoding strategy, with enhancements in the focus of the proposed architectures.

2D object proposals, quickly detected regions in an image that likely contain an object of interest, are an effective approach for improving the computational efficiency and accuracy of object detection in color images. In this work, we propose a novel online method that generates 3D object proposals in a RGB-D video sequence. Our main observation is that depth images provide important information about the geometry of the scene. Diverging from the traditional goal of 2D object proposals to provide a high recall (lots of 2D bounding boxes near potential objects), we aim for precise 3D proposals. We leverage on depth information per frame and multi-view scene information to obtain accurate 3D object proposals. Using efficient but robust registration enables us to combine multiple frames of a scene in near real time and generate 3D bounding boxes for potential 3D regions of interest. Using standard metrics, such as Precision-Recall curves and F-measure, we show that the proposed approach is significantly more accurate than the current state-of-the-art techniques. Our online approach can be integrated into SLAM based video processing for quick 3D object localization. Our method takes less than a second in MATLAB on the UW-RGBD scene dataset on a single thread CPU and thus, has potential to be used in low-power chips in Unmanned Aerial Vehicles (UAVs), quadcopters, and drones.

RGB-D Video Sequences Research Articles

Related Topics

Articles published on RGB-D Video Sequences

Deep learning-based approaches for human motion decoding in smart walkers for rehabilitation

Detailed Surface Geometry and Albedo Recovery from RGB-D Video under Natural Illumination.

Continuous perception for deformable objects understanding

Social Activity Recognition on Continuous RGB-D Video Sequences

The RBO dataset of articulated objects and interactions

Depth Super-Resolution on RGB-D Video Sequences with Large Displacement 3D Motion.

Real-time human segmentation from RGB-D video sequence based on adaptive geodesic distance computation

Locating 3D Object Proposals: A Depth-Based Online Approach

Point Light Source Position Estimation From RGB-D Images by Learning Surface Attributes.

Dense RGB-D Map-Based Human Tracking and Activity Recognition using Skin Joints Features and Self-Organizing Map

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

RGB-D Video Sequences Research Articles

Related Topics

Articles published on RGB-D Video Sequences

Deep learning-based approaches for human motion decoding in smart walkers for rehabilitation

Detailed Surface Geometry and Albedo Recovery from RGB-D Video under Natural Illumination.

Continuous perception for deformable objects understanding

Social Activity Recognition on Continuous RGB-D Video Sequences

The RBO dataset of articulated objects and interactions

Depth Super-Resolution on RGB-D Video Sequences with Large Displacement 3D Motion.

Real-time human segmentation from RGB-D video sequence based on adaptive geodesic distance computation

Locating 3D Object Proposals: A Depth-Based Online Approach

Point Light Source Position Estimation From RGB-D Images by Learning Surface Attributes.

Dense RGB-D Map-Based Human Tracking and Activity Recognition using Skin Joints Features and Self-Organizing Map