On simplified global nonlinear function for fitness landscape: a case study of inverse protein folding.

Yun Xu,Jie Liang,Changyu Hu,Yang Dai

doi:10.1371/journal.pone.0104403

Yun Xu, Jie Liang + Show 2 more

Open Access

PDF Available

https://doi.org/10.1371/journal.pone.0104403

Copy DOI

Export

Save

Cite

Journal: PLoS ONE	Publication Date: Aug 11, 2014
Citations: 2	License type: CC BY 4.0

Affiliation: University of Illinois at Chicago

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

The construction of fitness landscape has broad implication in understanding molecular evolution, cellular epigenetic state, and protein structures. We studied the problem of constructing fitness landscape of inverse protein folding or protein design, with the aim to generate amino acid sequences that would fold into an a priori determined structural fold which would enable engineering novel or enhanced biochemistry. For this task, an effective fitness function should allow identification of correct sequences that would fold into the desired structure. In this study, we showed that nonlinear fitness function for protein design can be constructed using a rectangular kernel with a basis set of proteins and decoys chosen a priori. The full landscape for a large number of protein folds can be captured using only 480 native proteins and 3,200 non-protein decoys via a finite Newton method. A blind test of a simplified version of fitness function for sequence design was carried out to discriminate simultaneously 428 native sequences not homologous to any training proteins from 11 million challenging protein-like decoys. This simplified function correctly classified 408 native sequences (20 misclassifications, 95% correct rate), which outperforms several other statistical linear scoring function and optimized linear function. Our results further suggested that for the task of global sequence design of 428 selected proteins, the search space of protein shape and sequence can be effectively parametrized with just about 3,680 carefully chosen basis set of proteins and decoys, and we showed in addition that the overall landscape is not overly sensitive to the specific choice of this set. Our results can be generalized to construct other types of fitness landscape.

Highlights

Protein design has been the focus of many experimental, theoretical, and computational studies [1,2,3,4,5,6,7,8,9]
To obtain such a nonlinear function, our goal is to find a set of parameters faD,aN g such that H(c) has fitness value close to {1 for native proteins, and has fitness values close to z1 for decoys
Because we are unaware of any other development of design fitness functions amenable for high-throughput tests, and frequently no distinctions were made between protein folding potential and protein design fitness function, we compared our fitness function with several well-established scoring functions developed for protein folding

Summary

Introduction

Protein design has been the focus of many experimental, theoretical, and computational studies [1,2,3,4,5,6,7,8,9]. We studied the problem of designing a protein sequence that is compatible with an a priori specified three-dimensional template protein fold. This problem was first formulated 30 years ago [16,17]. Known as the inverse protein folding problem, it addresses the fundamental problem of designing proteins to facilitate engineering of proteins with enhanced or novel biochemical functions. An ideal fitness function can characterize the properties of fitness landscape of many proteins simultaneously. Such a fitness function would be useful for designing novel proteins and novel functions, as well as for studying the global evolution of protein structure and protein functions

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

On simplified global nonlinear function for fitness landscape: a case study of inverse protein folding.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: PLoS ONE

Lead the way for us

Similar Papers

Global Nonlinear Fitness Function for Protein Structures
Yun Xu ... Yang Dai
-
Yun Xu, et. al.Yun Xu ... Yang Dai
01 Jan 2017
01 Jan 2017

Simplified Global Nonlinear Function for Fitness Landscape of Protein Design
Yun Xu ... Jie Liang
Biophysical Journal | VOL. 98
Yun Xu, et. al.Yun Xu ... Jie Liang
01 Jan 2009
Biophysical Journal | VOL. 98

Wright meets AD: not all landscapes are adaptive
M Kirkpatrick ... F Rousset
Journal of Evolutionary Biology | VOL. 18
M Kirkpatrick, et. al.M Kirkpatrick ... F Rousset
25 Aug 2005
Journal of Evolutionary Biology | VOL. 18

Transferable coarse-grained potential for de novo protein folding and design.
Ivan Coluzza
PLoS ONE | VOL. 9
Ivan ColuzzaIvan Coluzza
01 Dec 2014
PLoS ONE | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

On simplified global nonlinear function for fitness landscape: a case study of inverse protein folding.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: PLoS ONE