Abstract
Bioinformatics techniques to relative solvent accessibility (RSA) prediction are mostly single-stage approaches; they predict solvent accessibility of proteins by taking into account only the information available in amino acid sequences. We propose to use support vector machines (SVMs) as a second stage following the existing single-stage approaches for RSA prediction problem to improve the accuracy. The purpose of the second stage is to capture the contextual relationship of solvent accessibility elements in a neighborhood in determining the solvent accessibility at a particular site. We demonstrate our approach by introducing SVMs to the output of single-stage SVM classifier. Two-stage SVM approach achieves accuracies up to 90.4% and 90.2% on the Manesh dataset of 215 protein structures and the RS126 dataset of 126 nonhomologous globular proteins, respectively, which are better than the highest reported scores on both datasets to date.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.