Empirical Study of Protein Feature Representation on Deep Belief Networks Trained With Small Data for Secondary Structure Prediction.

Shamima Rashid,Chee Keong Kwoh,Suresh Sundaram

doi:10.1109/tcbb.2022.3168676

Abstract

Protein secondary structure (SS) prediction is a classic problem of computational biology and is widely used in structural characterization and to infer homology. While most SS predictors have been trained on thousands of sequences, a previous approach had developed a compact model of training proteins that used a C-Alpha, C-Beta Side Chain (CABS)-algorithm derived energy based feature representation. Here, the previous approach is extended to Deep Belief Networks (DBN). Deep learning methods are notorious for requiring large datasets and there is a wide consensus that training deep models from scratch on small datasets, works poorly. By contrast, we demonstrate a simple DBN architecture containing a single hidden layer, trained only on the CB513 dataset. Testing on an independent set of G Switch proteins improved the Q 3 score of the previous compact model by almost 3%. The findings are further confirmed by comparison to several deep learning models which are trained on thousands of proteins. Finally, the DBN performance is also compared with Position Specific Scoring Matrix (PSSM)-profile based feature representation. The importance of (i) structural information in protein feature representation and (ii) complementary small dataset learning approaches for detection of structural fold switching are demonstrated.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE/ACM Transactions on Computational Biology and Bioinformatics	Publication Date: Mar 1, 2023
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Empirical Study of Protein Feature Representation on Deep Belief Networks Trained With Small Data for Secondary Structure Prediction.

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Computational Biology and Bioinformatics

Lead the way for us

Similar Papers

Protein secondary structure prediction by using deep learning method
Yangxu Wang ... Zhang Yi
Knowledge-Based Systems | VOL. 118
Yangxu Wang, et. al.Yangxu Wang ... Zhang Yi
17 Nov 2016
Knowledge-Based Systems | VOL. 118

Improving Prediction of Protein Secondary Structures using Attention-enhanced Deep Neural Networks
Mukhtar Ahmad Sofi ... M Arif Wani
-
Mukhtar Ahmad Sofi, et. al.Mukhtar Ahmad Sofi ... M Arif Wani
23 Mar 2022
23 Mar 2022

RiRPSSP: A unified deep learning method for prediction of regular and irregular protein secondary structures
Mukhtar Ahmad Sofi ... M Arif Wani
Journal of Bioinformatics and Computational Biology | VOL. 21
Mukhtar Ahmad Sofi, et. al.Mukhtar Ahmad Sofi ... M Arif Wani
01 Feb 2023
Journal of Bioinformatics and Computational Biology | VOL. 21

OneHotEncoding and LSTM-based deep learning models for protein secondary structure prediction
Vamsidhar Enireddy ... D. Vijendra Babu
Soft Computing | VOL. 26
Vamsidhar Enireddy, et. al.Vamsidhar Enireddy ... D. Vijendra Babu
12 Feb 2022
Soft Computing | VOL. 26

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Empirical Study of Protein Feature Representation on Deep Belief Networks Trained With Small Data for Secondary Structure Prediction.

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Computational Biology and Bioinformatics