Abstract

β-turn is one of the most important reverse turns because of its role in protein folding. Many computational methods have been studied for predicting β-turns and β-turn types. However, due to the imbalanced dataset, the performance is still inadequate. In this study, we proposed a novel over-sampling technique FOST to deal with the class-imbalance problem. Experimental results on three standard benchmark datasets showed that our method is comparable with state-of-the-art methods. In addition, we applied our algorithm to five benchmark datasets from UCI Machine Learning Repository and achieved significant improvement in G-mean and Sensitivity. It means that our method is also effective for various imbalanced data other than β-turns and β-turn types.

Highlights

  • Secondary structure that includes regular and irregular patterns is important in protein folding study because it can be a building block of three-dimensional structures

  • We evaluate the performance of the novel over-sampling algorithm on the five other datasets from UCI Machine Learning Repository

  • In addition to three standard benchmark datasets above, we evaluated the performance of our novel over-sampling algorithm Flexible Over-Sampling Technique (FOST) on the five datasets which were obtained from UCI Machine Learning Repository [34]: Haberman’s Survival, Pima Indian Diabetes, Glass Identification, Landsat Satellite, and Yeast

Read more

Summary

Introduction

Secondary structure that includes regular and irregular patterns is important in protein folding study because it can be a building block of three-dimensional structures. The regular structures, which are sequences of residues with repeating φ and ψ values, are classified in α-helix and β-strand. While this group is well defined, the irregular structures that cover 50% of remaining protein residues are classified as coils. Tight turn is the most important one from the viewpoint of protein structure as well as function [1]. Tight turns are categorized as δ-, γ-, β-, α-, and π-turns according to the number of consecutive residues in the turn

Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.