Abstract

Theoretic study in this paper shows that we can obtain exact long-range contacts by adopting one classifier if the centers of sequence profiles of residue pairs for long-range contacts and non-long-range contacts are known. The adopted classifier, referred to as multiple conditional probability mass function classifier (MCPMFC), can find an optimized transformation of the variables for each of the classes and therefore resulting in K separate classifiers. As a result, about 44.48% long-range contacts are around at the sequence profile (SP) centre for long-range contacts and about 20.9% long-range contacts are correctly predicted when considering the top L/5 (L is the protein sequence length) predicted contacts and the residue pair with 24 apart. The highest cluster result gives us a clue that SP center should be a sound pathway to investigate contact map in protein structures.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call