Query Vector Research Articles

In the Chinese character writing task performed by robotic arms, the stroke category and position information should be extracted through object detection. Detection algorithms based on predefined anchor frames have difficulty resolving the differences among the many different styles of Chinese character strokes. Deformable detection transformer (deformable DETR) algorithms without predefined anchor frames result in some invalid sampling points with no contribution to the feature update of the current reference point due to the random sampling of sampling points in the deformable attention module. These processes cause a reduction in the speed of the vector learning stroke features in the detection head. In view of this problem, a new detection method for multi-style strokes of Chinese characters, called the simple conditional spatial query mask deformable DETR (SCSQ-MDD), is proposed in this paper. Firstly, a mask prediction layer is jointly determined using the shallow feature map of the Chinese character image and the query vector of the transformer encoder, which is used to filter the points with actual contributions and resample the points without contributions to address the randomness of the correlation calculation among the reference points. Secondly, by separating the content query and spatial query of the transformer decoder, the dependence of the prediction task on the content embedding is relaxed. Finally, the detection model without predefined anchor frames based on the SCSQ-MDD is constructed. Experiments are conducted using a multi-style Chinese character stroke dataset to evaluate the performance of the SCSQ-MDD. The mean average precision (mAP) value is improved by 3.8% and the mean average recall (mAR) value is improved by 1.1% compared with the deformable DETR in the testing stage, illustrating the effectiveness of the proposed method.

Read full abstract

Microblogging platforms have seen exponential growth, leading to an abundance of user-generated content. The challenge now is to efficiently extract crucial information from this vast and dispersed text data. It also serves as the goal of our research on Automatic Keyphrase Extraction (AKE) for microblog. Eye-tracking signals, that reflect users' tendency to prioritize certain words while reading, have been employed to enhance AKE performance from microblogs. However, relying solely on eye-tracking has its limitations owing to constraints in physiological mechanism support, acquisition techniques, and feature decoding. Consequently, we propose the integration of electroencephalogram (EEG) signals with eye-tracking signals to improve microblogs-based AKE, thereby overcoming the aforementioned limitations. Our first step is identifying specific features present in cognitive signals generated during human reading. We selected EEG signals (8 features) and eye-tracking signals (17 features) from the cognitive language processing corpus ZUCO, to examine the efficacy when they are combined with the microblogs-based AKE. To avoid cognitive signal distortion by certain model structures, we introduced these signals at the inputs of the soft attention layer and at the query vectors of the self-attention layer. For evaluation, we performed several AKE tests on microblogs with various combinations of cognitive signals. The results demonstrate a consistent enhancement in the performance of AKE due to cognitive signals generated during human reading, regardless of different feature combinations and models. Specifically, EEG signals exhibited the most significant improvement. However, combining EEG signals with eye-tracking signals yielded results that fell between the performance levels of the two signal types, indicating that their integration might have some synergistic effects. Further investigation is needed to understand the underlying mechanisms responsible for this outcome. The code and dataset for this paper can be accessed at https://github.com/yan-xinyi/AKE.

Read full abstract

Query Vector Research Articles

Related Topics

Articles published on Query Vector

Prototype-Based Semantic Segmentation.

Application of End-to-End Perception Framework Based on Boosted DETR in UAV Inspection of Overhead Transmission Lines

Sea–Land Segmentation of Remote-Sensing Images with Prompt Mask-Attention

Indonesian-English Textual Similarity Detection Using Universal Sentence Encoder (USE) and Facebook AI Similarity Search (FAISS)

Decoupled Cross-Modal Transformer for Referring Video Object Segmentation.

Discriminative Feature Enhancement Network for few-shot classification and beyond

HGCTNet: Handcrafted Feature-Guided CNN and Transformer Network for Wearable Cuffless Blood Pressure Measurement.

Extracting Key-phrase Embedding using Deep Average Network and Maximal Marginal Relevance to Enhance Information Retrieval

Adaptive weighted multi-view subspace clustering method for recognizing urban functions from multi-source social sensing data

DBGCN

Deep code search efficiency based on clustering

SeRF: Segment Graph for Range-Filtering Approximate Nearest Neighbor Search

Black-Box Boundary Attack Based on Gradient Optimization

Simple Conditional Spatial Query Mask Deformable Detection Transformer: A Detection Approach for Multi-Style Strokes of Chinese Characters

A SWOT-Based Framework for Personalized Ranking of IoT Service Providers With Generalized Fuzzy Data for Sustainable Transport in Urban Regions

Semi-supervised inverted file index approach for approximate nearest neighbor search

Utilizing cognitive signals generated during human reading to enhance keyphrase extraction from microblogs

Adaptive2Former: Enhancing Chromosome Instance Segmentation with Adaptive Query Decoder

Predicting Pedestrian Trajectories with Deep Adversarial Networks Considering Motion and Spatial Information

Sketch-based semantic retrieval of medical images

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Query Vector Research Articles

Related Topics

Articles published on Query Vector

Prototype-Based Semantic Segmentation.

Application of End-to-End Perception Framework Based on Boosted DETR in UAV Inspection of Overhead Transmission Lines

Sea–Land Segmentation of Remote-Sensing Images with Prompt Mask-Attention

Indonesian-English Textual Similarity Detection Using Universal Sentence Encoder (USE) and Facebook AI Similarity Search (FAISS)

Decoupled Cross-Modal Transformer for Referring Video Object Segmentation.

Discriminative Feature Enhancement Network for few-shot classification and beyond

HGCTNet: Handcrafted Feature-Guided CNN and Transformer Network for Wearable Cuffless Blood Pressure Measurement.

Extracting Key-phrase Embedding using Deep Average Network and Maximal Marginal Relevance to Enhance Information Retrieval

Adaptive weighted multi-view subspace clustering method for recognizing urban functions from multi-source social sensing data

DBGCN

Deep code search efficiency based on clustering

SeRF: Segment Graph for Range-Filtering Approximate Nearest Neighbor Search

Black-Box Boundary Attack Based on Gradient Optimization

Simple Conditional Spatial Query Mask Deformable Detection Transformer: A Detection Approach for Multi-Style Strokes of Chinese Characters

A SWOT-Based Framework for Personalized Ranking of IoT Service Providers With Generalized Fuzzy Data for Sustainable Transport in Urban Regions

Semi-supervised inverted file index approach for approximate nearest neighbor search

Utilizing cognitive signals generated during human reading to enhance keyphrase extraction from microblogs

Adaptive2Former: Enhancing Chromosome Instance Segmentation with Adaptive Query Decoder

Predicting Pedestrian Trajectories with Deep Adversarial Networks Considering Motion and Spatial Information

Sketch-based semantic retrieval of medical images