Rectification Network Research Articles

Objective. The decline in the performance of electromyography (EMG)-based silent speech recognition is widely attributed to disparities in speech patterns, articulation habits, and individual physiology among speakers. Feature alignment by learning a discriminative network that resolves domain offsets across speakers is an effective method to address this problem. The prevailing adversarial network with a branching discriminator specializing in domain discrimination renders insufficiently direct contribution to categorical predictions of the classifier.Approach. To this end, we propose a simplified discrepancy-based adversarial network with a streamlined end-to-end structure for EMG-based cross-subject silent speech recognition. Highly aligned features across subjects are obtained by introducing a Nuclear-norm Wasserstein discrepancy metric on the back end of the classification network, which could be utilized for both classification and domain discrimination. Given the low-level and implicitly noisy nature of myoelectric signals, we devise a cascaded adaptive rectification network as the front-end feature extraction network, adaptively reshaping the intermediate feature map with automatically learnable channel-wise thresholds. The resulting features effectively filter out domain-specific information between subjects while retaining domain-invariant features critical for cross-subject recognition.Main results. A series of sentence-level classification experiments with 100 Chinese sentences demonstrate the efficacy of our method, achieving an average accuracy of 89.46% tested on 40 new subjects by training with data from 60 subjects. Especially, our method achieves a remarkable 10.07% improvement compared to the state-of-the-art model when tested on 10 new subjects with 20 subjects employed for training, surpassing its result even with three times training subjects.Significance. Our study demonstrates an improved classification performance of the proposed adversarial architecture using cross-subject myoelectric signals, providing a promising prospect for EMG-based speech interactive application.

Read full abstract

The exploration of linguistic information promotes the development of scene text recognition task. Benefiting from the significance in parallel reasoning and global relationship capture, transformer-based language model (TLM) has achieved dominant performance recently. As a decoupled structure from the recognition process, we argue that TLM's capability is limited by the input low-quality visual prediction. To be specific: 1) The visual prediction with low character-wise accuracy increases the correction burden of TLM. 2) The inconsistent word length between visual prediction and original image provides a wrong language modeling guidance in TLM. In this paper, we propose a Progressive scEne Text Recognizer (PETR) to improve the capability of transformer-based language model by handling above two problems. Firstly, a Destruction Learning Module (DLM) is proposed to consider the linguistic information in the visual context. DLM introduces the recognition of destructed images with disordered patches in the training stage. Through guiding the vision model to restore patch orders and make word-level prediction on the destructed images, visual prediction with high character-wise accuracy is obtained by exploring inner relationship between the local visual patches. Secondly, a new Language Rectification Module (LRM) is proposed to optimize the word length for language guidance rectification. Through progressively implementing LRM in different language modeling steps, a novel progressive rectification network is constructed to handle some extremely challenging cases (e.g. distortion, occlusion, etc.). By utilizing DLM and LRM, PETR enhances the capability of transformer-based language model from a more general aspect, that is, focusing on the reduction of correction burden and rectification of language modeling guidance. Compared with parallel transformer-based methods, PETR obtains 1.0% and 0.8% improvement on regular and irregular datasets respectively while introducing only 1.7M additional parameters. The extensive experiments on both English and Chinese benchmarks demonstrate that PETR achieves the state-of-the-art results.

Read full abstract

Rectification Network Research Articles

Articles published on Rectification Network

A simplified adversarial architecture for cross-subject silent speech recognition using electromyography

Coronary artery segmentation in CCTA images based on multi-scale feature learning.

Depthwise Convolution for Multi-Agent Communication With Enhanced Mean-Field Approximation.

AIR-CNN: a lightweight automatic image rectification CNN used for barrel distortion

Level set guided region prototype rectification network for retinal vessel segmentation

A dual-band, wide-angle absorbing metasurface for EM energy harvesting and wireless power transfer

Image projective transformation rectification with synthetic data for smartphone-captured chest X-ray photos classification

FishRecGAN: An End to End GAN Based Network for Fisheye Rectification and Calibration

A Two-Level Rectification Attention Network for Scene Text Recognition

Revisiting Radial Distortion Rectification in Polar-Coordinates: A New and Efficient Learning Perspective

An attention-based network for serial number recognition on banknotes

Adaptive rectification based adversarial network with spectrum constraint for high-quality PET image synthesis.

Robust Correlation Tracking in Unmanned Aerial Vehicle Videos via Deep Target-Specific Rectification Networks

PETR: Rethinking the Capability of Transformer-Based Language Model in Scene Text Recognition.

Adversarial learning based attentional scene text recognizer

Progressive rectification network for irregular text recognition

Learning Rich Part Hierarchies with Progressive Attention Networks for Fine-Grained Image Recognition.

MORAN: A Multi-Object Rectified Attention Network for scene text recognition

ASTER: An Attentional Scene Text Recognizer with Flexible Rectification.

The Heartmate II: design and development of a fully sealed axial flow left ventricular assist system.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Rectification Network Research Articles

Articles published on Rectification Network

A simplified adversarial architecture for cross-subject silent speech recognition using electromyography

Coronary artery segmentation in CCTA images based on multi-scale feature learning.

Depthwise Convolution for Multi-Agent Communication With Enhanced Mean-Field Approximation.

AIR-CNN: a lightweight automatic image rectification CNN used for barrel distortion

Level set guided region prototype rectification network for retinal vessel segmentation

A dual-band, wide-angle absorbing metasurface for EM energy harvesting and wireless power transfer

Image projective transformation rectification with synthetic data for smartphone-captured chest X-ray photos classification

FishRecGAN: An End to End GAN Based Network for Fisheye Rectification and Calibration

A Two-Level Rectification Attention Network for Scene Text Recognition

Revisiting Radial Distortion Rectification in Polar-Coordinates: A New and Efficient Learning Perspective

An attention-based network for serial number recognition on banknotes

Adaptive rectification based adversarial network with spectrum constraint for high-quality PET image synthesis.

Robust Correlation Tracking in Unmanned Aerial Vehicle Videos via Deep Target-Specific Rectification Networks

PETR: Rethinking the Capability of Transformer-Based Language Model in Scene Text Recognition.

Adversarial learning based attentional scene text recognizer

Progressive rectification network for irregular text recognition

Learning Rich Part Hierarchies with Progressive Attention Networks for Fine-Grained Image Recognition.

MORAN: A Multi-Object Rectified Attention Network for scene text recognition

ASTER: An Attentional Scene Text Recognizer with Flexible Rectification.

The Heartmate II: design and development of a fully sealed axial flow left ventricular assist system.