Improve Recognition Performance Research Articles

Surgical workflow analysis is essential to help optimize surgery by encouraging efficient communication and the use of resources. However, the performance of phase recognition is limited by the use of information related to the presence of surgical instruments. To address the problem, we propose visual modality-based multimodal fusion for surgical phase recognition to overcome the limited diversity of information such as the presence of instruments. Using the proposed methods, we extracted a visual kinematics-based index related to using instruments, such as movement and their interrelations during surgery. In addition, we improved recognition performance using an effective convolutional neural network (CNN)-based fusion method for visual features and a visual kinematics-based index (VKI). The visual kinematics-based index improves the understanding of a surgical procedure since information is related to instrument interaction. Furthermore, these indices can be extracted in any environment, such as laparoscopic surgery, and help obtain complementary information for system kinematics log errors. The proposed methodology was applied to two multimodal datasets, a virtual reality (VR) simulator-based dataset (PETRAW) and a private distal gastrectomy surgery dataset, to verify that it can help improve recognition performance in clinical environments. We also explored the influence of a visual kinematics-based index to recognize each surgical workflow by the instrument’s existence and the instrument’s trajectory. Through the experimental results of a distal gastrectomy video dataset, we validated the effectiveness of our proposed fusion approach in surgical phase recognition. The relatively simple yet index-incorporated fusion we propose can yield significant performance improvements over only CNN-based training and exhibits effective training results compared to fusion based on Transformers, which require a large amount of pre-trained data.

Read full abstract

Previous studies have demonstrated menstrual cycle dependent changes in the recognition of facial emotional expressions, specifically the expression of fear, anger, sadness or disgust. While some studies demonstrate an improvement of emotion recognition performance during the peri-ovulatory phase, when estradiol levels peak, other studies demonstrate a deterioration of emotion recognition performance during the mid-luteal phase, when progesterone levels peak. It has been hypothesized, that these changes in emotion recognition performance mirror mood changes along the menstrual cycle. In the present study, we investigate, whether changes in emotion recognition performance along the menstrual cycle are mediated by mood changes along the menstrual cycle. In a combined cross-sectional and longitudinal study design, two large samples of women completed an emotion recognition task, as well as several mood questionnaires during their menses, peri-ovulatory or mid-luteal cycle phase. 65 women completed the task thrice, once during each cycle phase, order counterbalanced. In order to control for potential learning effects, a sample of 153 women completed the task only once in one of the three cycle phases. In both samples, results demonstrated no significant changes in emotion recognition performance along the menstrual cycle, irrespective of the performance measure investigated (accuracy, reaction time, frequency of emotion classifications) and irrespective of the emotion displayed. Bayesian statistics provided very strong evidence for the null hypothesis, that emotion recognition does not change along the menstrual cycle. There was also no moderation of emotion recognition changes along the menstrual cycle by mood changes along the menstrual cycle. Mood changes along the menstrual cycle followed the expected pattern with highest positive affect and least premenstrual symptoms around ovulation and lowest positive affect, but strongest premenstrual symptoms during menses. Interestingly, premenstrual symptoms were negatively related to estradiol, suggesting a protective effect of estrogen during the luteal cycle phase against mood worsening during the premenstrual phase.

Read full abstract

Improve Recognition Performance Research Articles

Related Topics

Articles published on Improve Recognition Performance

Multimodal Sparse Transformer Network for Audio-Visual Speech Recognition.

SAR-ATR with knowledge hierarchy division and information dissemination networks

ReFuSeAct: Representation fusion using self-supervised learning for activity recognition in next generation networks

Hybrid Deep Neural Network Framework Combining Skeleton and Gait Features for Pathological Gait Recognition.

Toward enriched decoding of mandarin spontaneous speech

Visual modalities-based multimodal fusion for surgical phase recognition

Does Tactile Stimulation of the Face Affect the Processing of Other Faces? Neural and Behavioural Effects of Facial Touch

Research on Weibo New Word Recognition based on Weibo Data and Statistical Information

Intelligent vision for the detection of chemistry glassware toward AI robotic chemists

Handwritten Javanese script recognition method based 12-layers deep convolutional neural network and data augmentation

CRABR-Net: A Contextual Relational Attention-Based Recognition Network for Remote Sensing Scene Objective.

Cross-modal alignment and translation for missing modality action recognition

Deep cross-domain transfer for emotion recognition via joint learning

Optimized Classifier Learning for Face Recognition Performance Boost in Security and Surveillance Applications.

Person Re-Identification Method Based on Dual Descriptor Feature Enhancement.

Emotion recognition and mood along the menstrual cycle

Few-shot person re-identification based on Feature Set Augmentation and Metric Fusion

Mean Teacher-Based Cross-Domain Activity Recognition Using WiFi Signals

Space-and-speaker-aware acoustic modeling with effective data augmentation for recognition of multi-array conversational speech

Contrastive self-supervised representation learning without negative samples for multimodal human action recognition.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Improve Recognition Performance Research Articles

Related Topics

Articles published on Improve Recognition Performance

Multimodal Sparse Transformer Network for Audio-Visual Speech Recognition.

SAR-ATR with knowledge hierarchy division and information dissemination networks

ReFuSeAct: Representation fusion using self-supervised learning for activity recognition in next generation networks

Hybrid Deep Neural Network Framework Combining Skeleton and Gait Features for Pathological Gait Recognition.

Toward enriched decoding of mandarin spontaneous speech

Visual modalities-based multimodal fusion for surgical phase recognition

Does Tactile Stimulation of the Face Affect the Processing of Other Faces? Neural and Behavioural Effects of Facial Touch

Research on Weibo New Word Recognition based on Weibo Data and Statistical Information

Intelligent vision for the detection of chemistry glassware toward AI robotic chemists

Handwritten Javanese script recognition method based 12-layers deep convolutional neural network and data augmentation

CRABR-Net: A Contextual Relational Attention-Based Recognition Network for Remote Sensing Scene Objective.

Cross-modal alignment and translation for missing modality action recognition

Deep cross-domain transfer for emotion recognition via joint learning

Optimized Classifier Learning for Face Recognition Performance Boost in Security and Surveillance Applications.

Person Re-Identification Method Based on Dual Descriptor Feature Enhancement.

Emotion recognition and mood along the menstrual cycle

Few-shot person re-identification based on Feature Set Augmentation and Metric Fusion

Mean Teacher-Based Cross-Domain Activity Recognition Using WiFi Signals

Space-and-speaker-aware acoustic modeling with effective data augmentation for recognition of multi-array conversational speech

Contrastive self-supervised representation learning without negative samples for multimodal human action recognition.