Multi-head Attention Mechanism Research Articles

Secondary structure prediction is a key step in understanding protein function and biological properties and is highly important in the fields of new drug development, disease treatment, bioengineering, etc. Accurately predicting the secondary structure of proteins helps to reveal how proteins are folded and how they function in cells. The application of deep learning models in protein structure prediction is particularly important because of their ability to process complex sequence information and extract meaningful patterns and features, thus significantly improving the accuracy and efficiency of prediction. In this study, a combined model integrating an improved temporal convolutional network (TCN), bidirectional long short-term memory (BiLSTM), and a multi-head attention (MHA) mechanism is proposed to enhance the accuracy of protein prediction in both eight-state and three-state structures. One-hot encoding features and word vector representations of physicochemical properties are incorporated. A significant emphasis is placed on knowledge distillation techniques utilizing the ProtT5 pretrained model, leading to performance improvements. The improved TCN, achieved through multiscale fusion and bidirectional operations, allows for better extraction of amino acid sequence features than traditional TCN models. The model demonstrated excellent prediction performance on multiple datasets. For the TS115, CB513 and PDB (2018–2020) datasets, the prediction accuracy of the eight-state structure of the six datasets in this paper reached 88.2%, 84.9%, and 95.3%, respectively, and the prediction accuracy of the three-state structure reached 91.3%, 90.3%, and 96.8%, respectively. This study not only improves the accuracy of protein secondary structure prediction but also provides an important tool for understanding protein structure and function, which is particularly applicable to resource-constrained contexts and provides a valuable tool for understanding protein structure and function.

Read full abstract

ABSTRACT The precipitation estimation with high spatial and temporal resolution over Tibet is of great significance for monitoring the stability of the plateau ecosystem and early warning of natural disasters. In response to the problem of the unsatisfactory accuracy in traditional real-time precipitation inversion methods and quantitative accuracy estimation products in this region. This paper adopts deep-learning methods to explore the corresponding relationship between precipitation and multi-band observation data from geostationary satellite imagery. The U-High Resolution Network (U-HRNet) as backbone is fully exploited to generate multi-scale features and fused using attention gates to extract high-resolution spatial representations representing precipitation clouds, thereby achieving precipitation estimation of target clouds at different scales. In addition, in response to the characteristics of high precipitation intensity and drastic cloud cluster changes in summer convective weather within a short period of time, a multi-head attention mechanism is introduced to extract temporal information from time series inputs and fuse it with spatial features to improve the accuracy of real-time precipitation estimation. Compared with the operational precipitation products and baseline deep-learning models, the classification metrics such as Probability of detection (POD), False Alarm Ratio (FAR) and Critical success index (CSI) show that the proposed model (U-HRNet+) has better performance in test set and has high generalization ability for estimating short-term heavy rainfall over Tibet and its surrounding areas in summer. Therefore, the proposed model has the potential to serve as a more accurate and reliable satellite-based precipitation estimation product.

Read full abstract

Multi-head Attention Mechanism Research Articles

Related Topics

Articles published on Multi-head Attention Mechanism

Tackling heterogeneity in medical federated learning via aligning vision transformers

ICKA: An instruction construction and Knowledge Alignment framework for Multimodal Named Entity Recognition

Fault Diagnosis Method for Tractor Transmission System Based on Improved Convolutional Neural Network–Bidirectional Long Short-Term Memory

Prediction of protein secondary structure by the improved TCN-BiLSTM-MHA model with knowledge distillation

Cross-region feature fusion with geometrical relationship for OCR-based image captioning

Prediction of Sunspot Number with Hybrid Model Based on 1D-CNN, BiLSTM and Multi-Head Attention Mechanism

Quality non-destructive sorting of large yellow croaker based on image recognition

Dynamic Spatio-Temporal Adaptive Graph Convolutional Recurrent Networks for Vacant Parking Space Prediction

Exploiting Optical Flow Guidance for Transformer-Based Video Inpainting.

Temporal graph convolutional network for multi-agent reinforcement learning of action detection

Advancing architectural heritage: precision decoding of East Asian timber structures from Tang dynasty to traditional Japan

Design of Tool Wear Monitoring System in Bone Material Drilling Process

EEG Emotion Recognition Employing RGPCN-BiGRUAM: ReliefF-Based Graph Pooling Convolutional Network and BiGRU Attention Mechanism

Enhanced coalbed methane well production prediction framework utilizing the CNN-BL-MHA approach

U-HRNet+: a real-time precipitation estimation method based on the fusion of temporal and spatial domain features

Multi-domain fusion for cargo UAV fault diagnosis knowledge graph construction

EPI-Trans: an effective transformer-based deep learning model for enhancer promoter interaction prediction

Enhanced Thermal Modeling of Electric Vehicle Motors Using a Multihead Attention Mechanism

Accurate prediction of drug combination risk levels based on relational graph convolutional network and multi-head attention

A short-term power load forecasting system based on data decomposition, deep learning and weighted linear error correction with feedback mechanism

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multi-head Attention Mechanism Research Articles

Related Topics

Articles published on Multi-head Attention Mechanism

Tackling heterogeneity in medical federated learning via aligning vision transformers

ICKA: An instruction construction and Knowledge Alignment framework for Multimodal Named Entity Recognition

Fault Diagnosis Method for Tractor Transmission System Based on Improved Convolutional Neural Network–Bidirectional Long Short-Term Memory

Prediction of protein secondary structure by the improved TCN-BiLSTM-MHA model with knowledge distillation

Cross-region feature fusion with geometrical relationship for OCR-based image captioning

Prediction of Sunspot Number with Hybrid Model Based on 1D-CNN, BiLSTM and Multi-Head Attention Mechanism

Quality non-destructive sorting of large yellow croaker based on image recognition

Dynamic Spatio-Temporal Adaptive Graph Convolutional Recurrent Networks for Vacant Parking Space Prediction

Exploiting Optical Flow Guidance for Transformer-Based Video Inpainting.

Temporal graph convolutional network for multi-agent reinforcement learning of action detection

Advancing architectural heritage: precision decoding of East Asian timber structures from Tang dynasty to traditional Japan

Design of Tool Wear Monitoring System in Bone Material Drilling Process

EEG Emotion Recognition Employing RGPCN-BiGRUAM: ReliefF-Based Graph Pooling Convolutional Network and BiGRU Attention Mechanism

Enhanced coalbed methane well production prediction framework utilizing the CNN-BL-MHA approach

U-HRNet+: a real-time precipitation estimation method based on the fusion of temporal and spatial domain features

Multi-domain fusion for cargo UAV fault diagnosis knowledge graph construction

EPI-Trans: an effective transformer-based deep learning model for enhancer promoter interaction prediction

Enhanced Thermal Modeling of Electric Vehicle Motors Using a Multihead Attention Mechanism

Accurate prediction of drug combination risk levels based on relational graph convolutional network and multi-head attention

A short-term power load forecasting system based on data decomposition, deep learning and weighted linear error correction with feedback mechanism