Prediction Of Functional Sites Research Articles

Model quality assessments via computational methods which entail comparisons of the modeled structures to the experimentally determined structures are essential in the field of protein structure prediction. The assessments provide means to benchmark the accuracies of the modeling techniques and to aid with their development. We previously described the ResiRole method to gauge model quality principally based on the preservation of the structural characteristics described in SeqFEATURE functional site prediction models. We apply ResiRole to benchmark modeling group performances in the Critical Assessment of Structure Prediction experiment, round 15. To gauge model quality, a normalized Predicted Functional site Similarity Score (PFSS) was calculated as the average of one minus the absolute values of the differences of the functional site prediction probabilities, as found for the experimental structures versus those found at the corresponding sites in the structure models. The average PFSS per modeling group (gPFSS) correlates with standard quality metrics, and can effectively be used to rank the accuracies of the groups. For the free modeling (FM) category, correlation coefficients of the Local Distance Difference Test (LDDT) and Global Distance Test-Total Score (GDT-TS) metrics with gPFSS were 0.98239 and 0.87691, respectively. An example finding for a specific group is that the gPFSS for EMBER3D was higher than expected based on the predictive relationship between gPFSS and LDDT. We infer the result is due to the use of constraints imprinted by function that are a part of the EMBER3D methodology. Also, we find functional site predictions that may guide further functional characterizations of the respective proteins. The gPFSS metric provides an effective means to assess and rank the performances of the structure prediction techniques according to their abilities to accurately recount the structural features at predicted functional sites.

Read full abstract

Biological sequence analysis is an important basic research work in the field of bioinformatics. With the explosive growth of data, machine learning methods play an increasingly important role in biological sequence analysis. By constructing a classifier for prediction, the input sequence feature vector is predicted and evaluated, and the knowledge of gene structure, function and evolution is obtained from a large amount of sequence information, which lays a foundation for researchers to carry out in-depth research. At present, many machine learning methods have been applied to biological sequence analysis such as RNA gene recognition and protein secondary structure prediction. As a biological sequence, RNA plays an important biological role in the encoding, decoding, regulation and expression of genes. The analysis of RNA data is currently carried out from the aspects of structure and function, including secondary structure prediction, non-coding RNA identification and functional site prediction. Pseudouridine (У) is the most widespread and rich RNA modification and has been discovered in a variety of RNAs. It is highly essential for the study of related functional mechanisms and disease diagnosis to accurately identify У sites in RNA sequences. At present, several computational approaches have been suggested as an alternative to experimental methods to detect У sites, but there is still potential for improvement in their performance. In this study, we present a model based on twin support vector machine (TWSVM) for У site identification. The model combines a variety of feature representation techniques and uses the max-relevance and min-redundancy methods to obtain the optimum feature subset for training. The independent testing accuracy is improved by 3.4% in comparison to current advanced У site predictors. The outcomes demonstrate that our model has better generalization performance and improves the accuracy of У site identification. iPseU-TWSVM can be a helpful tool to identify У sites.

Read full abstract

Prediction Of Functional Sites Research Articles

Related Topics

Articles published on Prediction Of Functional Sites

Assessment of the Performances of the Protein Modeling Techniques Participating in CASP15 Using a Structure-Based Functional Site Prediction Approach: ResiRole.

COLLAPSE: A representation learning framework for identification and characterization of protein structural sites.

Alignment-free estimation of sequence conservation for identifying functional sites using protein sequence embeddings.

ScanNet: an interpretable geometric deep learning model for structure-based protein binding site prediction.

IPseU-TWSVM: Identification of RNA pseudouridine sites based on TWSVM.

Purification, characterization and functional site prediction of the vaccinia-related kinase 2A small transmembrane domain

InDeep: 3D fully convolutional neural networks to assist in silico drug design on protein-protein interactions.

TFBSPred: A functional transcription factor binding site prediction webtool for humans and mice

CATH functional families predict functional sites in proteins.

The ResiRole server to enable assessments of structure prediction techniques using functional site predictions

ResiRole: residue-level functional site predictions to gauge the accuracies of protein structure prediction techniques.

An Evolutionary Trace method defines functionally important bases and sites common to RNA families.

PIRSitePredict for protein functional site prediction using position-specific rules.

Factor cooperation for chromosome discrimination in Drosophila.

Applying Knowledge of Enzyme Biochemistry to the Prediction of Functional Sites for Aiding Drug Discovery.

A comprehensive software suite for protein family construction and functional site prediction.

Designing and in Silico Analysis of PorB Protein from Chlamydia Trachomatis for Developing a Vaccine Candidate.

Evaluation of free modeling targets in CASP11 and ROLL.

Protein Functional Site Prediction Using a Conservative Grade and a Proximate Grade

Functional classification of CATH superfamilies: a domain-based approach for protein function annotation.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Prediction Of Functional Sites Research Articles

Related Topics

Articles published on Prediction Of Functional Sites

Assessment of the Performances of the Protein Modeling Techniques Participating in CASP15 Using a Structure-Based Functional Site Prediction Approach: ResiRole.

COLLAPSE: A representation learning framework for identification and characterization of protein structural sites.

Alignment-free estimation of sequence conservation for identifying functional sites using protein sequence embeddings.

ScanNet: an interpretable geometric deep learning model for structure-based protein binding site prediction.

IPseU-TWSVM: Identification of RNA pseudouridine sites based on TWSVM.

Purification, characterization and functional site prediction of the vaccinia-related kinase 2A small transmembrane domain

InDeep: 3D fully convolutional neural networks to assist in silico drug design on protein-protein interactions.

TFBSPred: A functional transcription factor binding site prediction webtool for humans and mice

CATH functional families predict functional sites in proteins.

The ResiRole server to enable assessments of structure prediction techniques using functional site predictions

ResiRole: residue-level functional site predictions to gauge the accuracies of protein structure prediction techniques.

An Evolutionary Trace method defines functionally important bases and sites common to RNA families.

PIRSitePredict for protein functional site prediction using position-specific rules.

Factor cooperation for chromosome discrimination in Drosophila.

Applying Knowledge of Enzyme Biochemistry to the Prediction of Functional Sites for Aiding Drug Discovery.

A comprehensive software suite for protein family construction and functional site prediction.

Designing and in Silico Analysis of PorB Protein from Chlamydia Trachomatis for Developing a Vaccine Candidate.

Evaluation of free modeling targets in CASP11 and ROLL.

Protein Functional Site Prediction Using a Conservative Grade and a Proximate Grade

Functional classification of CATH superfamilies: a domain-based approach for protein function annotation.