Abstract

The prediction of contact maps in protein is a challenging topic for the determination of three-dimensional protein structures. In this paper, we introduce Forest of Decision Trees, a methodology for the prediction of protein contact maps based on (1) a divide-and-conquer approach to analyze the prediction problem; (2) a codification vector that combines the information obtained from the target amino acids neighborhood, and the sub-sequence between them; (3) an ensemble of classifiers that employs a hybrid of Genetic Algorithms and Decision Trees as base classifiers; and (4) a rulebased interpretation mechanism. The comparison against the top sequence-based methods in CASP10 showed that our predictor is very competitive, showing a high reliability. Their main advantage is its capability to generate a humancomprehensible rule-based interpretation mechanism, giving the specialist some clues to find an easier and interpretable solution for the protein-folding recognition and the prediction of unknown structures. Keywords: CASP10, contact maps prediction, decision trees, genetic algorithms, multiple classifier systems, protein structure prediction.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.