Length-dependent prediction of protein intrinsic disorder.

Kang Peng,Predrag Radivojac,A Keith Dunker,Zoran Obradovic,Slobodan Vucetic

doi:10.1186/1471-2105-7-208

Abstract

BackgroundDue to the functional importance of intrinsically disordered proteins or protein regions, prediction of intrinsic protein disorder from amino acid sequence has become an area of active research as witnessed in the 6th experiment on Critical Assessment of Techniques for Protein Structure Prediction (CASP6). Since the initial work by Romero et al. (Identifying disordered regions in proteins from amino acid sequences, IEEE Int. Conf. Neural Netw., 1997), our group has developed several predictors optimized for long disordered regions (>30 residues) with prediction accuracy exceeding 85%. However, these predictors are less successful on short disordered regions (≤30 residues). A probable cause is a length-dependent amino acid compositions and sequence properties of disordered regions.ResultsWe proposed two new predictor models, VSL2-M1 and VSL2-M2, to address this length-dependency problem in prediction of intrinsic protein disorder. These two predictors are similar to the original VSL1 predictor used in the CASP6 experiment. In both models, two specialized predictors were first built and optimized for short (≤30 residues) and long disordered regions (>30 residues), respectively. A meta predictor was then trained to integrate the specialized predictors into the final predictor model. As the 10-fold cross-validation results showed, the VSL2 predictors achieved well-balanced prediction accuracies of 81% on both short and long disordered regions. Comparisons over the VSL2 training dataset via 10-fold cross-validation and a blind-test set of unrelated recent PDB chains indicated that VSL2 predictors were significantly more accurate than several existing predictors of intrinsic protein disorder.ConclusionThe VSL2 predictors are applicable to disordered regions of any length and can accurately identify the short disordered regions that are often misclassified by our previous disorder predictors. The success of the VSL2 predictors further confirmed the previously observed differences in amino acid compositions and sequence properties between short and long disordered regions, and justified our approaches for modelling short and long disordered regions separately. The VSL2 predictors are freely accessible for non-commercial use at

Highlights

Due to the functional importance of intrinsically disordered proteins or protein regions, prediction of intrinsic protein disorder from amino acid sequence has become an area of active research as witnessed in the 6th experiment on Critical Assessment of Techniques for Protein Structure Prediction (CASP6)
Like the structural classification of ordered proteins, e.g. α-helix and β-sheet at the secondary structure level, and all α, all β, α/β and α+β classes at the tertiary structure level, we suggest that there are several subtypes of intrinsic disorder distinguished by amino acid compositions and sequence properties
Comparisons over VSL2 training dataset via 10-fold cross-validation and a blind-test set of unrelated recent PDB chains indicated that VSL2 predictors were significantly more accurate than several existing predictors of intrinsic protein disorder

Summary

Results

We proposed two new predictor models, VSL2-M1 and VSL2-M2, to address this lengthdependency problem in prediction of intrinsic protein disorder. These two predictors are similar to the original VSL1 predictor used in the CASP6 experiment. In both models, two specialized predictors were first built and optimized for short (≤30 residues) and long disordered regions (>30 residues), respectively. As the 10-fold cross-validation results showed, the VSL2 predictors achieved well-balanced prediction accuracies of 81% on both short and long disordered regions. Comparisons over the VSL2 training dataset via 10-fold cross-validation and a blind-test set of unrelated recent PDB chains indicated that VSL2 predictors were significantly more accurate than several existing predictors of intrinsic protein disorder

Conclusion

Background

Results and discussion

Uversky VN

10. Uversky VN

14. Rose GD

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Apr 17, 2006
Citations: 903	License type: cc-by

R Discovery Prime

R Discovery Prime

Length-dependent prediction of protein intrinsic disorder.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Directed mutational scanning reveals a balance between acidic and hydrophobic residues in strong human activation domains.
Max V Staller ... Rohit V Pappu
Cell systems | VOL. 13
Max V Staller, et. al.Max V Staller ... Rohit V Pappu
03 Feb 2022
Cell systems | VOL. 13

DisoFLAG: accurate prediction of protein intrinsic disorder and its functions using graph-based interaction protein language model
Yihe Pang ... Bin Liu
BMC Biology | VOL. 22
Yihe Pang, et. al.Yihe Pang ... Bin Liu
02 Jan 2024
BMC Biology | VOL. 22

The unfoldomics decade: an update on intrinsically disordered proteins.
A Keith Dunker ... Vladimir Vacic
BMC genomics | VOL. Suppl 9 2
A Keith Dunker, et. al.A Keith Dunker ... Vladimir Vacic
01 Jan 2008
BMC genomics | VOL. Suppl 9 2

Predicting intrinsic disorder from amino acid sequence
Zoran Obradovic ... Kang Peng
Proteins: Structure, Function, and Genetics | VOL. 53
Zoran Obradovic, et. al.Zoran Obradovic ... Kang Peng
01 Jan 2003
Proteins: Structure, Function, and Genetics | VOL. 53

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Length-dependent prediction of protein intrinsic disorder.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics