Two-stream Convolutional Neural Network Research Articles

The reliability of automated image interpretation of point-of-care (POC) echocardiography scans depends on the quality of the acquired ultrasound data. This work reports on the development and validation of spatiotemporal deep learning models to assess the suitability of input ultrasound cine loops collected using a handheld echocardiography device for processing by an automated quantification algorithm (e.g. ejection fraction estimation). POC echocardiograms (n=885 DICOM cine loops from 175 patients) from two sites were collected using a handheld ultrasound device and annotated for image quality at the frame-level. Attributes of high-quality frames for left ventricular (LV) quantification included a temporally-stable LV, reasonable coverage of LV borders, and good contrast between the borders and chamber. Attributes of low-quality frames included temporal instability of the LV and/or imaging artifacts (e.g., lack of contrast, haze, reverberation, acoustic shadowing). Three different neural network architectures were investigated - (a) frame-level convolutional neural network (CNN) which operates on individual echo frames (VectorCNN), (b) single-stream sequence-level CNN which operates on a sequence of echo frames (VectorCNN+LSTM) and (c) two-stream sequence-level CNNs which operate on a sequence of echo and optical flow frames (VectorCNN+LSTM+Average, VectorCNN+LSTM+MinMax, and VectorCNN+LSTM+ConvPool). Evaluation on a sequestered test dataset containing 76 DICOM cine loops with 16,914 frames showed that VectorCNN+LSTM can effectively utilize both spatial and temporal information to regress the quality of an input frame (accuracy: 0.925, sensitivity = 0.860, specificity = 0.952), compared to the frame-level VectorCNN that only utilizes spatial information in that frame (accuracy: 0.903, sensitivity = 0.791, specificity = 0.949). Furthermore, an independent sample t-test indicated that the cine loops classified to be of adequate quality by the VectorCNN+LSTM model had a statistically significant lower bias in the automatically estimated EF (mean bias = - 3.73 ± 7.46 %, versus a clinically obtained reference EF) compared to the loops classified as inadequate (mean bias = -15.92 ± 12.17 %) (p = 0.007). Thus, cine loop stratification using the proposed spatiotemporal CNN model improves the reliability of automated point-of-care echocardiography image interpretation.

Pedestrian behavior recognition in the driving environment is an important technology to prevent pedestrian accidents by predicting the next movement. It is necessary to recognize current pedestrian behavior to predict future pedestrian behavior. However, many studies have recognized human visible characteristics such as face, body parts or clothes, but few have recognized pedestrian behavior. It is challenging to recognize pedestrian behavior in the driving environment due to the changes in the camera field of view due to the illumination conditions in outdoor environments and vehicle movement. In this paper, to predict pedestrian behavior, we introduce a position-information added two-stream convolutional neural network (CNN) with multi task learning that is robust to the limited conditions of the outdoor driving environment. The conventional two-stream CNN is the most widely used model for human-action recognition. However, the conventional two-stream CNN based on optical flow has limitations regarding pedestrian behavior recognition in a moving vehicle because of the assumptions of brightness constancy and piecewise smoothness. To solve this problem for a moving vehicle, the binary descriptor dense scale-invariant feature transform (SIFT) flow, a feature-based matching algorithm, is robust in moving-pedestrian behavior recognition, such as walking and standing, in a moving vehicle. However, recognizing cross attributes, such as crossing or not crossing the street, is challenging using the binary descriptor dense SIFT flow because people who cross the road or not act the same walking action, but their location on the image is different. Therefore, pedestrian position information should be added to the conventional binary descriptor dense SIFT flow two-stream CNN. Thus, learning biased toward action attributes is evenly learned across action and cross attributes. In addition, YOLO detection and the Siamese tracker are used instead of the ground-truth boundary box to prove the robustness in the action- and cross-attribute recognition from a moving vehicle. The JAAD and PIE datasets were used for training, and only the JAAD dataset was used as a testing dataset for comparison with other state-of-the-art research on multitask and single-task learning.

Two-stream Convolutional Neural Network Research Articles

Related Topics

Articles published on Two-stream Convolutional Neural Network

Construction and Application of Quality Assessment Model of No-reference Images Two-Stream Convolutional Neural Network

DBCAN: DFormer-Based Cross-Attention Network for RGB Depth Semantic Segmentation

Research on Small Sample Rolling Bearing Fault Diagnosis Method Based on Mixed Signal Processing Technology

Two-Stream Convolutional Neural Networks for Breathing Pattern Classification: Real-Time Monitoring of Respiratory Disease Patients.

Constructing two-stream input matrices in a convolutional neural network for photovoltaic power prediction

Spatial-temporal multiscale feature optimization based two-stream convolutional neural network for action recognition

AF-FTTSnet: An end-to-end two-stream convolutional neural network for online quality monitoring of robotic welding

Spatiotemporal Deep Learning-Based Cine Loop Quality Filter for Handheld Point-of-Care Echocardiography.

Human Action Recognition using Methods Deep Learning

Target Detection Based on Two-Stream Convolution Neural Network With Self-Powered Sensors Information

Detection of background forgery using a two-stream convolutional neural network architecture

Retracted: A Novel English Translation Model in Complex Environments Using Two-Stream Convolutional Neural Networks.

Facial Expression Identification using Two-Stream Convolutional Neural Networks (TSCNNs) and Inception 3D Convolutional Neural Network (CNN)

Automated major depressive disorder diagnosis using a dual-input deep learning model and image generation from EEG signals

Direction-guided two-stream convolutional neural networks for skeleton-based action recognition

Convolutional Transformer Fusion Blocks for Multi-Modal Gesture Recognition

Contrastive Accelerometer–Gyroscope Embedding Model for Human Activity Recognition

Learning Spatiotemporal Graph Representations for Visual Perception Using EEG Signals.

Binary Dense SIFT Flow Based Position-Information Added Two-Stream CNN for Pedestrian Action Recognition

Bolt-Loosening Detection Using 1D and 2D Input Data Based on Two-Stream Convolutional Neural Networks.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Two-stream Convolutional Neural Network Research Articles

Related Topics

Articles published on Two-stream Convolutional Neural Network

Construction and Application of Quality Assessment Model of No-reference Images Two-Stream Convolutional Neural Network

DBCAN: DFormer-Based Cross-Attention Network for RGB Depth Semantic Segmentation

Research on Small Sample Rolling Bearing Fault Diagnosis Method Based on Mixed Signal Processing Technology

Two-Stream Convolutional Neural Networks for Breathing Pattern Classification: Real-Time Monitoring of Respiratory Disease Patients.

Constructing two-stream input matrices in a convolutional neural network for photovoltaic power prediction

Spatial-temporal multiscale feature optimization based two-stream convolutional neural network for action recognition

AF-FTTSnet: An end-to-end two-stream convolutional neural network for online quality monitoring of robotic welding

Spatiotemporal Deep Learning-Based Cine Loop Quality Filter for Handheld Point-of-Care Echocardiography.

Human Action Recognition using Methods Deep Learning

Target Detection Based on Two-Stream Convolution Neural Network With Self-Powered Sensors Information

Detection of background forgery using a two-stream convolutional neural network architecture

Retracted: A Novel English Translation Model in Complex Environments Using Two-Stream Convolutional Neural Networks.

Facial Expression Identification using Two-Stream Convolutional Neural Networks (TSCNNs) and Inception 3D Convolutional Neural Network (CNN)

Automated major depressive disorder diagnosis using a dual-input deep learning model and image generation from EEG signals

Direction-guided two-stream convolutional neural networks for skeleton-based action recognition

Convolutional Transformer Fusion Blocks for Multi-Modal Gesture Recognition

Contrastive Accelerometer–Gyroscope Embedding Model for Human Activity Recognition

Learning Spatiotemporal Graph Representations for Visual Perception Using EEG Signals.

Binary Dense SIFT Flow Based Position-Information Added Two-Stream CNN for Pedestrian Action Recognition

Bolt-Loosening Detection Using 1D and 2D Input Data Based on Two-Stream Convolutional Neural Networks.