Recognition Benchmark Research Articles

Stress is a prevalent issue in modern society which affects a large percentage of the population. Stress has negative effects on human daily lives such as reduced memory and concentration levels, forgetfulness, restlessness, and weakness. Although it is challenging to extinguish stress from human lives, monitoring and controlling its consequences can be implemented. Current methods for stress monitoring often lack interpretability of results and making it difficult to provide an interpretable system for immediate mental health counseling and intervention. Wearable devices are of growing interest due to the impact of the COVID-19 pandemic on mental health, as wearables provide monitoring of physiological parameters which helps in measuring stress levels. Here, an investigation on the feasibility of using wrist and chest-based multimodal physiological signals collected from wearables is executed for stress classification and identification of underlying modalities that impact the stress. Multimodalities from wrist and chest sensors are used to examine the three-class classification, baseline, stress and amusement. For improved and visualized effects, the multimodal signals from wearables were transformed into images. Then, a hybrid model was introduced, which comprised of attentive convolution neural network (CNN), transposed attentive CNN connections with long-short term memory (LSTM) and followed by attention layer for efficient feature extraction and improved classification accuracy. Results found that the proposed model with images provided 97 % accuracy and 96 % F1-score for chest wearable and 94 % accuracy and 93 % F1-score for wrist wearable data. Additionally, the post-hoc explainability approach (Local interpretable model agnostic applications, LIME) provided visual representations of contributing features from each signal for each class. LIME shows electrodermal activity (EDA), temperature (TEMP) and respiration (RESP) to be the significant factor in stress recognition from chest wearable and blood volume pulse (BVP) and EDA from wrist wearable. Additionally, increasing the number of features of the explainable model influences the modalities influence on the model explainability. The results establish a benchmark for explainable stress recognition using different sensor data.

Read full abstract

This research explores the complex yet applicable domain of universal visual recognition, where unlabeled datasets may encompass additional, as-of-yet unidentified classes and present a skewed feature distribution within labeled datasets. The primary challenge lies in concurrently distinguishing instances of known classes and uncovering new classes within the unlabeled data. This difficulty is compounded by two factors: category shift and distribution shift. Current methodologies typically overlook the potential of harnessing the inherent structural relationship between labeled and unlabeled datasets, and they fail to effectively isolate the semantic variability of the newly identified classes from the established ones. To overcome these hurdles, we introduce an innovative framework, which we have termed MESA (short for Manifold structure Exploitation with double Spaces dual Alignment), to advance the field of universal visual recognition. Our approach begins by identifying geometrically and semantically related pairs of nearest neighbors between labeled and unlabeled data, which serve as anchor points to foster an intrinsic match between the two datasets. By integrating an affinity score for similarity, we establish a novel cross-view, anchor-guided, self-supervised learning paradigm. This model is designed to spread the learned labels’ information and to conjoin novel semantic insights within the predictive space. To intensify the matching accuracy for the known classes and enhance the differentiation of novel classes, we further implement a confident-prototype-driven, self-supervised learning paradigm in a cross-view manner. This strategy is aimed at revealing the macroscopic category configuration within the embedding space. MESA employs a pioneering bidirectional dual-alignment mechanism that operates simultaneously in the embedding and prediction spaces, hence providing a more robust approach to confronting challenges brought about by distribution and category shifts. Our exhaustive evaluation across several extensive image recognition benchmarks substantiates MESA’s superior performance over the latest cutting-edge methods, marking a significant leap forward in the sphere of universal visual recognition. Finally, MESA is implemented in Python using the Pytorch machine-learning library, and it is freely available at https://github.com/aimeeyaoyao/MESA.

Read full abstract

Recognition Benchmark Research Articles

Related Topics

Articles published on Recognition Benchmark

A large corpus for the recognition of Greek Sign Language gestures

XFMP: A Benchmark for Explainable Fine-Grained Abnormal Behavior Recognition on Medical Personal Protective Equipment

Cross-view action recognition understanding from exocentric to egocentric perspective

Dynamic Routing and Knowledge Re-Learning for Data-Free Black-Box Attack.

InfoGCN++: Learning Representation by Predicting the Future for Online Skeleton-based Action Recognition.

HunFlair2 in a cross-corpus evaluation of biomedical named entity recognition and normalization tools.

MAFormer: A cross-channel spatio-temporal feature aggregation method for human action recognition

BViT: Broad Attention-Based Vision Transformer.

A hybrid transposed attention based deep learning model for wearable and explainable stress recognition

EMS: A Large-Scale Eye Movement Dataset, Benchmark, and New Model for Schizophrenia Recognition.

SiSe: Simultaneous and Sequential Transformers for multi-label activity recognition

Self-learning activation functions to increase accuracy of privacy-preserving Convolutional Neural Networks with homomorphic encryption.

Binary Encoding-Based Federated Learning for Traffic Sign Recognition in Autonomous Driving

TRAFFIC SIGN RECOGNITION IN CHALLENGING WEATHER CONDITIONS USING CONVOLUTIONAL NEURAL NETWORKS

Perturbation Augmentation for Adversarial Training with Diverse Attacks

Efficient pyramid channel attention network for pathological myopia recognition with pretraining-and-finetuning

An efficient motion visual learning method for video action recognition

Intrinsic structure exploitation with dual alignment for universal visual recognition

A lightweight network architecture for traffic sign recognition based on enhanced LeNet-5 network.

Depression risk recognition based on gait: A benchmark

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Recognition Benchmark Research Articles

Related Topics

Articles published on Recognition Benchmark

A large corpus for the recognition of Greek Sign Language gestures

XFMP: A Benchmark for Explainable Fine-Grained Abnormal Behavior Recognition on Medical Personal Protective Equipment

Cross-view action recognition understanding from exocentric to egocentric perspective

Dynamic Routing and Knowledge Re-Learning for Data-Free Black-Box Attack.

InfoGCN++: Learning Representation by Predicting the Future for Online Skeleton-based Action Recognition.

HunFlair2 in a cross-corpus evaluation of biomedical named entity recognition and normalization tools.

MAFormer: A cross-channel spatio-temporal feature aggregation method for human action recognition

BViT: Broad Attention-Based Vision Transformer.

A hybrid transposed attention based deep learning model for wearable and explainable stress recognition

EMS: A Large-Scale Eye Movement Dataset, Benchmark, and New Model for Schizophrenia Recognition.

SiSe: Simultaneous and Sequential Transformers for multi-label activity recognition

Self-learning activation functions to increase accuracy of privacy-preserving Convolutional Neural Networks with homomorphic encryption.

Binary Encoding-Based Federated Learning for Traffic Sign Recognition in Autonomous Driving

TRAFFIC SIGN RECOGNITION IN CHALLENGING WEATHER CONDITIONS USING CONVOLUTIONAL NEURAL NETWORKS

Perturbation Augmentation for Adversarial Training with Diverse Attacks

Efficient pyramid channel attention network for pathological myopia recognition with pretraining-and-finetuning

An efficient motion visual learning method for video action recognition

Intrinsic structure exploitation with dual alignment for universal visual recognition

A lightweight network architecture for traffic sign recognition based on enhanced LeNet-5 network.

Depression risk recognition based on gait: A benchmark