Features Of Video Frames Research Articles

This paper leverages each pixel of a picture acquired from a video camera, in which structural dynamic information is contained, in order to decompose spatiotemporal information from such a non-contact virtual sensor array in the same way as traditional accelerometers to extract structural modal frequencies. Attention-based deep neural network architecture is proposed in this work to better visualize the dynamic properties of structures in the existence of noise with a high resolution. The work combines CNNs and Recurrent Neural Networks (RNNs) to predict modal frequencies of structures from a series of consecutive images. High discriminative features of video frames are firstly extracted using the CNN, and then Conv-Long Short-Term Memory (ConvLSTM) is applied to further process the extracted features to capture the temporal dynamics in videos. The attention mechanisms are embedded in the network to ensure the model learns to focus selectively on those frames containing system dynamics. In particular, the proposed computer vision-based deep learning model takes the video of a vibrating structure as the input and successfully estimates the modal frequencies. Transfer learning is applied to cohere the knowledge learned from publicly available datasets to a much more sophisticated structure and estimate the resonant frequencies. The proposed algorithm optimizes the filter design for video processing in a fully automated way without any human intervention and can generalize and transfer that learned information to more complex structures. The model is trained using publicly available generic baseline data (Dataset A) consisting of several simple beam structures with different material properties and sizes and transferred the learned knowledge to unseen data (Dataset B) consisting of an independent turbine blade. It is concluded that the newly proposed method is more autonomous, accurate, and capable of generalizing the model to a new independent dataset using a transfer learning strategy, and the most advantage of the proposed approach is that the trained deep learning architecture has the capability of estimating the resonant frequencies for independent structures and extending the resonant frequency estimations to higher modes.

This paper provides an in-depth discussion of human motion recognition in Virtual Reality (VR) video sequences through hidden Markov models, which are four steps from VR video acquisition and pre-processing, foreground detection, extraction of human feature parameters, and hidden Markov model human motion recognition. A hybrid Gaussian model was used to build a background model in real-time based on changes in VR video information, and the image was subtracted by the background differential method. The optical flow method was used for foreground detection of the target, and the effects of sparse and dense optical flow were compared to obtain the motion characteristics and optical flow information of the target human body, respectively. Features were extracted for human motion, in terms of common geometric features of the body and optical flow information, respectively. In terms of common geometric information, the width-to-height ratio, perimeter-to-area ratio, center of mass, eccentricity, and feature angle were extracted, respectively. For the optical flow information, optical flow descriptors were constructed using a grid-based approach. And feature fusion was performed for the above two parameters by the k-means method to construct the word pocket model. The hidden Markov model parameters were obtained by using the hidden Markov model for the recognition of human motion and training the human feature parameters for each of the four motions. The recognition of the four common human body movements was realized by the forward-backward algorithm. The test results show that the motion recognition method in this paper has high recognition performance and good anti-interference performance. The time-sequence pooling is used to sort the effective video frame feature sequences to obtain the feature vectors that can represent the dynamic changes of video time sequence; finally, the time-sequence feature vectors are used to train the support vector machine for classification recognition. The recognition accuracy is 65.2% and 89.4% for the HMDB51 and UCF101 datasets, respectively.

Features Of Video Frames Research Articles

Related Topics

Articles published on Features Of Video Frames

Automated detection of steps in videos of strabismus surgery using deep learning

Cross-stage recurrent feature sharing network for video dehazing

Facial expression recognition based on adaptation of the classifier to videos of the user

Gaze prediction based on long short-term memory convolution with associated features of video frames

Sequential Order-Aware Coding-Based Robust Subspace Clustering for Human Action Recognition in Untrimmed Videos.

A hybrid-attention-ConvLSTM-based deep learning architecture to extract modal frequencies from limited data using transfer learning

A multi-flexible video summarization scheme using property-constraint decision tree

Region Dual Attention-Based Video Emotion Recognition.

Detecting abnormality with separated foreground and background: Mutual Generative Adversarial Networks for video abnormal event detection

Zero-watermarking Algorithm for Audio and Video Matching Verification

Key Clips and Key Frames Extraction of Videos Based on Deep Learning

Video emotion recognition based on Convolutional Neural Networks

An Efficient CNN and BI-LSTM Model for Abnormal Event Detection in Video Surveillance

Translating video into language by enhancing visual and language representations

Abnormal Event Detection via Feature Expectation Subgraph Calibrating Classification in Video Surveillance Scenes

Key Algorithm for Human Motion Recognition in Virtual Reality Video Sequences Based on Hidden Markov Model

Joint Dynamic Pose Image and Space Time Reversal for Human Action Recognition from Videos

Spatial-temporal pyramid based Convolutional Neural Network for action recognition

An attention mechanism based convolutional LSTM network for video action recognition

Event detection in soccer videos using unsupervised learning of Spatio-temporal features based on pooled spatial pyramid model

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Features Of Video Frames Research Articles

Related Topics

Articles published on Features Of Video Frames

Automated detection of steps in videos of strabismus surgery using deep learning

Cross-stage recurrent feature sharing network for video dehazing

Facial expression recognition based on adaptation of the classifier to videos of the user

Gaze prediction based on long short-term memory convolution with associated features of video frames

Sequential Order-Aware Coding-Based Robust Subspace Clustering for Human Action Recognition in Untrimmed Videos.

A hybrid-attention-ConvLSTM-based deep learning architecture to extract modal frequencies from limited data using transfer learning

A multi-flexible video summarization scheme using property-constraint decision tree

Region Dual Attention-Based Video Emotion Recognition.

Detecting abnormality with separated foreground and background: Mutual Generative Adversarial Networks for video abnormal event detection

Zero-watermarking Algorithm for Audio and Video Matching Verification

Key Clips and Key Frames Extraction of Videos Based on Deep Learning

Video emotion recognition based on Convolutional Neural Networks

An Efficient CNN and BI-LSTM Model for Abnormal Event Detection in Video Surveillance

Translating video into language by enhancing visual and language representations

Abnormal Event Detection via Feature Expectation Subgraph Calibrating Classification in Video Surveillance Scenes

Key Algorithm for Human Motion Recognition in Virtual Reality Video Sequences Based on Hidden Markov Model

Joint Dynamic Pose Image and Space Time Reversal for Human Action Recognition from Videos

Spatial-temporal pyramid based Convolutional Neural Network for action recognition

An attention mechanism based convolutional LSTM network for video action recognition

Event detection in soccer videos using unsupervised learning of Spatio-temporal features based on pooled spatial pyramid model