Pfeature: A Tool for Computing Wide Range of Protein Features and Building Prediction Models.

Akshara Pande,Salman Sadullah Usmani,Rajesh Kumar,Gajendra P.S Raghava,Dilraj Kaur,Anjali Dhall,Shipra Jain,Harpreet Kaur,Chakit Arora,Gaurav Mishra,Vinod Kumar,Neelam Sharma,Anjali Lathwal,Sumeet Patiyal,Piyush Agrawal

doi:10.1089/cmb.2022.0241

Abstract

In the last three decades, a wide range of protein features have been discovered to annotate a protein. Numerous attempts have been made to integrate these features in a software package/platform so that the user may compute a wide range of features from a single source. To complement the existing methods, we developed a method, Pfeature, for computing a wide range of protein features. Pfeature allows to compute more than 200,000 features required for predicting the overall function of a protein, residue-level annotation of a protein, and function of chemically modified peptides. It has six major modules, namely, composition, binary profiles, evolutionary information, structural features, patterns, and model building. Composition module facilitates to compute most of the existing compositional features, plus novel features. The binary profile of amino acid sequences allows to compute the fraction of each type of residue as well as its position. The evolutionary information module allows to compute evolutionary information of a protein in the form of a position-specific scoring matrix profile generated using Position-Specific Iterative Basic Local Alignment Search Tool (PSI-BLAST); fit for annotation of a protein and its residues. A structural module was developed for computing of structural features/descriptors from a tertiary structure of a protein. These features are suitable to predict the therapeutic potential of a protein containing non-natural or chemically modified residues. The model-building module allows to implement various machine learning techniques for developing classification and regression models as well as feature selection. Pfeature also allows the generation of overlapping patterns and features from a protein. A user-friendly Pfeature is available as a web server python library and stand-alone package.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Pfeature: A Tool for Computing Wide Range of Protein Features and Building Prediction Models.

Abstract

Talk to us

Similar Papers

More From: Journal of Computational Biology

Lead the way for us

Journal: Journal of Computational Biology	Publication Date: Oct 13, 2022
Citations: 37

Similar Papers

Accelerating iterative protein sequence alignment on a heterogeneous GPU-CPU platform
Mai Said ... Ayman Wahba
-
Mai Said, et. al.Mai Said ... Ayman Wahba
01 Jul 2016
01 Jul 2016

Improving the Prediction of Protein Structural Class for Low-Similarity Sequences by Incorporating Evolutionaryand Structural Information
Liang Kong ... Rong Jing
Journal of Advanced Computational Intelligence and Intelligent Informatics | VOL. 20
Liang Kong, et. al.Liang Kong ... Rong Jing
19 May 2016
Journal of Advanced Computational Intelligence and Intelligent Informatics | VOL. 20

High performance computing workflow for protein functional annotation
Larissa Stanberry ... Bhanu Rekepalli
-
Larissa Stanberry, et. al.Larissa Stanberry ... Bhanu Rekepalli
22 Jul 2013
22 Jul 2013

Development of a sugar-binding residue prediction system from protein sequences using support vector machine
Masaki Banno ... Kentaro Shimizu
Computational Biology and Chemistry | VOL. 66
Masaki Banno, et. al.Masaki Banno ... Kentaro Shimizu
09 Nov 2016
Computational Biology and Chemistry | VOL. 66

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Pfeature: A Tool for Computing Wide Range of Protein Features and Building Prediction Models.

Abstract

Talk to us

Similar Papers

More From: Journal of Computational Biology