Abstract

The protein secondary structure prediction (PSSP) is pivotal for predicting tertiary structure, which is proliferating in demand for drug design and development. Further, it can be used to learn different protein functions. Although there are many computational methods for protein structure prediction, none of them have succeeded 100% in solving the protein structure prediction problem. This paper proposes a deep learning model, namely Cascaded Feature Learning Model (CFLM) that uses a multi-stage transfer learning approach based on the Residual Dense Network (RDN) for predicting protein secondary structure. The model is trained with different protein datasets at various levels of transfer learning and accepts selected protein features such as solvent accessibility, physicochemical properties, PSSM (position-specific scoring matrix), and PSFM (position-specific frequency matrix) as input. The Q3 and Q8 accuracy obtained on CASP 13 and 14 benchmark datasets with the proposed approach are 91.23% and 91.45%, and 76.83% and 78.04%, respectively. The performance of CFLM is also compared with some of the recent PSSP approaches and it is observed that the proposed approach improves the prediction accuracy by 2.8% in terms of Q3 and 1.81% in terms of Q8 in the CASP 14 dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.