Extract Spatiotemporal Features Research Articles

EEG-based emotion recognition has gradually become a new research direction, known as affective Brain-Computer Interface (aBCI), which has huge application potential in human-computer interaction and neuroscience. However, how to extract spatio-temporal fusion features from complex EEG signals and build learning method with high recognition accuracy and strong interpretability is still challenging. In this paper, we propose a hybrid attention spatio-temporal feature fusion network for EEG-based emotion recognition. First, we designed a spatial attention feature extractor capable of merging shallow and deep features to extract spatial information and adaptively select crucial features under different emotional states. Then, the temporal feature extractor based on the multi-head attention mechanism is integrated to perform spatio-temporal feature fusion to achieve emotion recognition. Finally, we visualize the extracted spatial attention features using feature maps, further analyzing key channels corresponding to different emotions and subjects. Our method outperforms the current state-of-the-art methods on two public datasets, SEED and DEAP. The recognition accuracy are 99.12% ± 1.25% (SEED), 98.93% ± 1.45% (DEAP-arousal), and 98.57% ± 2.60% (DEAP-valence). We also conduct ablation experiments, using statistical methods to analyze the impact of each module on the final result. The spatial attention features reveal that emotion-related neural patterns indeed exist, which is consistent with conclusions in the field of neurology. The experimental results show that our method can effectively extract and fuse spatial and temporal information. It has excellent recognition performance, and also possesses strong robustness, performing stably across different datasets and experimental environments for emotion recognition.

Read full abstract

Nowadays, Transformer-based visual tracking algorithms have been developing quickly because of the self-attention mechanism of Transformer, which has the capability to model global information. Although the self-attention mechanism in Transformer can effectively capture long-range dependencies in feature space, they only use flattened two-dimensional features and are unable to capture long-range temporal dependencies. Furthermore, since the self-attention in Transformer functions as a low-pass filter, it picks up on low-frequency features of the target while ignoring high-frequency features. This research suggests a Transformer tracker based on action information and mix-frequency features (AMTrack) to address these problems. Specifically, to address the lack of temporal remote dependencies, we introduce the target action aware module and the target action offset module. The target action aware module sets up several pathways to extract spatio-temporal, channel, and motion feature independently. In contrast, the target action offset module computes the target’s offset information by computing relative feature maps. Furthermore, in order to address the imbalance between high and low frequency features, we propose a mix-frequency attention and multi-frequency self-attention convolutional block. The mix-frequency attention uses high-frequency features within partitioned local windows as input for the high-frequency branch and average-pooled low-frequency features as the input for the low-frequency branch, calculating attention scores in both branches respectively. The multi-frequency self-attention convolutional block uses self-attention to capture low-frequency features and convolution to capture high-frequency features. Extensive experiments are carried out on eight challenging tracking datasets (e.g., OTB100 (Object Tracking Benchmark 100), NFS (Need For Speed), UAV123 (Unmanned Aerial Vehicles 123), TC128 (Temple Color 128), VOT2018 (Visual Object Tracking 2018), LaSOT (Large-scale Single Object Tracking), TrackingNet (Tracking Network), GOT-10k (Generic Object Tracking-10k)), and the experimental results show that our tracker achieves excellent tracking performance when compared with several state-of-the-art tracking algorithms. The experimental results show that on LaSOT, the success rates AUC (Area Under Curve), PNorm, and P reach 65.8%, 69.2%, and 68.0%, respectively, where the AUC value is 2.1% higher than the baseline algorithm TrDiMP (Transformer Discriminative Model Prediction). On other datasets, our tracker also achieves excellent tracking performance.

Read full abstract

Extract Spatiotemporal Features Research Articles

Related Topics

Articles published on Extract Spatiotemporal Features

Multifeature extraction based MobileViTv3 model for fish feeding behavior recognition from video streaming

Transformer-based multiview spatiotemporal feature interactive fusion for human action recognition in depth videos

Early stroke behavior detection based on improved video masked autoencoders for potential patients

Fall detection method based on spatio-temporal coordinate attention for high-resolution networks

HASTF: a hybrid attention spatio-temporal feature fusion network for EEG emotion recognition.

An Ultra‐Short‐Term Wind Power Prediction Method Based on Spatiotemporal Characteristics Fusion

Identification of plagioclase extinction-angle features from polarized images using deep neural network.

Gear Classification in Skating Cross-Country Skiing Using Inertial Sensors and Deep Learning.

Emotion Recognition Using EEG Signals and Audiovisual Features with Contrastive Learning.

AONet: Attention network with optional activation for unsupervised video anomaly detection

AMTrack:Transformer tracking via action information and mix-frequency features

Trajectory Privacy-Protection Mechanism Based on Multidimensional Spatial–Temporal Prediction

Optimized deep learning modelling for predicting the diffusion range and state change of filling projects

Research on Surgical Gesture Recognition in Open Surgery Based on Fusion of R3D and Multi-Head Attention Mechanism

An evolutionary deep learning model based on XGBoost feature selection and Gaussian data augmentation for AQI prediction

PI-STGnet: Physics-integrated spatiotemporal graph neural network with fundamental diagram learner for highway traffic flow prediction

PulseNet: Multi-task learning-based non-contact pulse condition diagnosis using multi-scale fusion and transformer

Single-Trial Detection and Classification of Event-Related Optical Signals for a Brain-Computer Interface Application.

Research on investment project risk prediction and management based on machine learning

Predicting spatio-temporal traffic flow: a comprehensive end-to-end approach from surveillance cameras

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Extract Spatiotemporal Features Research Articles

Related Topics

Articles published on Extract Spatiotemporal Features

Multifeature extraction based MobileViTv3 model for fish feeding behavior recognition from video streaming

Transformer-based multiview spatiotemporal feature interactive fusion for human action recognition in depth videos

Early stroke behavior detection based on improved video masked autoencoders for potential patients

Fall detection method based on spatio-temporal coordinate attention for high-resolution networks

HASTF: a hybrid attention spatio-temporal feature fusion network for EEG emotion recognition.

An Ultra‐Short‐Term Wind Power Prediction Method Based on Spatiotemporal Characteristics Fusion

Identification of plagioclase extinction-angle features from polarized images using deep neural network.

Gear Classification in Skating Cross-Country Skiing Using Inertial Sensors and Deep Learning.

Emotion Recognition Using EEG Signals and Audiovisual Features with Contrastive Learning.

AONet: Attention network with optional activation for unsupervised video anomaly detection

AMTrack:Transformer tracking via action information and mix-frequency features

Trajectory Privacy-Protection Mechanism Based on Multidimensional Spatial–Temporal Prediction

Optimized deep learning modelling for predicting the diffusion range and state change of filling projects

Research on Surgical Gesture Recognition in Open Surgery Based on Fusion of R3D and Multi-Head Attention Mechanism

An evolutionary deep learning model based on XGBoost feature selection and Gaussian data augmentation for AQI prediction

PI-STGnet: Physics-integrated spatiotemporal graph neural network with fundamental diagram learner for highway traffic flow prediction

PulseNet: Multi-task learning-based non-contact pulse condition diagnosis using multi-scale fusion and transformer

Single-Trial Detection and Classification of Event-Related Optical Signals for a Brain-Computer Interface Application.

Research on investment project risk prediction and management based on machine learning

Predicting spatio-temporal traffic flow: a comprehensive end-to-end approach from surveillance cameras