Sorting protein decoys by machine-learning-to-rank.

Xiaoyang Jing,Kai Wang,Ruqian Lu,Qiwen Dong

doi:10.1038/srep31571

Xiaoyang Jing, Kai Wang + Show 2 more

Open Access

https://doi.org/10.1038/srep31571

Copy DOI

Abstract

Much progress has been made in Protein structure prediction during the last few decades. As the predicted models can span a broad range of accuracy spectrum, the accuracy of quality estimation becomes one of the key elements of successful protein structure prediction. Over the past years, a number of methods have been developed to address this issue, and these methods could be roughly divided into three categories: the single-model methods, clustering-based methods and quasi single-model methods. In this study, we develop a single-model method MQAPRank based on the learning-to-rank algorithm firstly, and then implement a quasi single-model method Quasi-MQAPRank. The proposed methods are benchmarked on the 3DRobot and CASP11 dataset. The five-fold cross-validation on the 3DRobot dataset shows the proposed single model method outperforms other methods whose outputs are taken as features of the proposed method, and the quasi single-model method can further enhance the performance. On the CASP11 dataset, the proposed methods also perform well compared with other leading methods in corresponding categories. In particular, the Quasi-MQAPRank method achieves a considerable performance on the CASP11 Best150 dataset.

Highlights

Much progress has been made in Protein structure prediction during the last few decades
As the predicted models can span a broad range of accuracy spectrum, the accuracy of quality estimation becomes one of the key elements of successful protein structure prediction
The Quasi-MQAPRank method achieves a considerable performance on the CASP11 Best[150] dataset

Summary

Introduction

Much progress has been made in Protein structure prediction during the last few decades. Previous studies have found that clustering-based methods generally outperform single-model methods when numerous models are available from several different structure prediction methods[15,16], which is confirmed by the CASP (Critical Assessment of protein Structure Prediction)[4,5,17]. The problem of decoy model quality assessment could be deemed as ranking the decoy models based on their similarities to the corresponding native structure. These similarities can be measured by various structural alignment scoring methods, such as GDT_TS score (global distance test total score)[1], TM-score[25], Max-sub score[26], LGA score[27] etc. In view of its good performance, learning-to-rank methods have been applied in many bioinformatics tasks including disease name normalization[33], biomedical document retrieval[34], gene summary extraction[35], protein folding energy designing[36], etc

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: Aug 1, 2016
Citations: 24	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Sorting protein decoys by machine-learning-to-rank.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

Deep Learning Methods for Lung Cancer Segmentation in Whole-Slide Histopathology Images-The ACDC@LungHP Challenge 2019.
...
IEEE journal of biomedical and health informatics | VOL. 25
, et. al. ...
20 Nov 2020
Deep Learning Methods for Lung Cancer Segmentation in Whole-Slide Histopathology Images-The ACDC@LungHP Challenge 2019.
...

Improved model quality assessment using ProQ2
Arjun Ray ... Erik Lindahl
BMC Bioinformatics | VOL. 13
Arjun Ray, et. al.Arjun Ray ... Erik Lindahl
10 Sep 2012
BMC Bioinformatics | VOL. 13

Assessment of protein model structure accuracy estimation in CASP14: Old and new challenges.
Sohee Kwon ... Chaok Seok
Proteins | VOL. 89
Sohee Kwon, et. al.Sohee Kwon ... Chaok Seok
05 Aug 2021
Proteins | VOL. 89

ProQ2: estimation of model accuracy implemented in Rosetta.
Karolis Uziela ... Björn Wallner
Bioinformatics | VOL. 32
Karolis Uziela, et. al.Karolis Uziela ... Björn Wallner
05 Jan 2016
Bioinformatics | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sorting protein decoys by machine-learning-to-rank.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports