PSSM-Distil: Protein Secondary Structure Prediction (PSSP) on Low-Quality PSSM by Knowledge Distillation with Contrastive Learning

Qin Wang,Jiaxiang Wu,Zhenlei Xu,Peilin Zhao,Zhen Li,Sheng Wang,Shuguang Cui,Junzhou Huang,Boyuan Wang

doi:10.1609/aaai.v35i1.16141

Abstract

Protein secondary structure prediction (PSSP) is an essential task in computational biology. To achieve the accurate PSSP, the general and vital feature engineering is to use multiple sequence alignment (MSA) for Position-Specific Scoring Matrix (PSSM) extraction. However, when only low-quality PSSM can be obtained due to poor sequence homology, previous PSSP accuracy (merely around 65%) is far from practical usage for subsequent tasks. In this paper, we propose a novel PSSM-Distil framework for PSSP on low-quality PSSM, which not only enhances the PSSM feature at a lower level but also aligns the feature distribution at a higher level. In practice, the PSSM-Distil first exploits the proteins with high-quality PSSM to achieve a teacher network for PSSP in a full-supervised way. Under the guidance of the teacher network, the low-quality PSSM and corresponding student network with low discriminating capacity are effectively resolved by feature enhancement through EnhanceNet and distribution alignment through knowledge distillation with contrastive learning. Further, our PSSM-Distil supports the input from a pre-trained protein sequence language BERT model to provide auxiliary information, which is designed to address the extremely low-quality PSSM cases, i.e., no homologous sequence. Extensive experiments demonstrate the proposed PSSM-Distil outperforms state-of-the-art models on PSSP by 6% on average and nearly 8% in extremely low-quality cases on public benchmarks, BC40 and CB513.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

PSSM-Distil: Protein Secondary Structure Prediction (PSSP) on Low-Quality PSSM by Knowledge Distillation with Contrastive Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: May 18, 2021
Citations: 8

Similar Papers

Prior knowledge facilitates low homologous protein secondary structure prediction with DSM distillation.
Qin Wang ... Lenore Cowen
Computer applications in the biosciences : CABIOS | VOL. 38
Qin Wang, et. al.Qin Wang ... Lenore Cowen
02 Jun 2022
Computer applications in the biosciences : CABIOS | VOL. 38

Protein Secondary Structure Prediction Using Local Adaptive Techniques in Training Neural Networks
Lim Eng Aik ... Annie Joseph
-
Lim Eng Aik, et. al.Lim Eng Aik ... Annie Joseph
01 Jan 2008
01 Jan 2008

Parallel protein secondary structure prediction based on neural networks
W Zhong ... X Tian
-
W Zhong, et. al.W Zhong ... X Tian
01 Jan 2004
01 Jan 2004

Combining hydrophobicity with PSSM for protein secondary structure prediction using BP neural network
...
International Journal of Biomedical Engineering | VOL. 31
, et. al. ...
28 Oct 2008
International Journal of Biomedical Engineering | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PSSM-Distil: Protein Secondary Structure Prediction (PSSP) on Low-Quality PSSM by Knowledge Distillation with Contrastive Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence