Abstract

In this study, the dipeptide composition of 3 216 thermophilic and 4 007 mesophilic protein sequences was systematically analyzed. It was found that the thermophilic proteins contained larger number of dipeptides such as EE, EK, KE, VE, EI, KI, EV, KK, VK and IE, and smaller number of dipeptides such as AA, LL, LA, AL, QA, QL, AQ, LT, TL and EQ. Hence, a statistical method was developed for the discrimination of thermophilic and mesophilic proteins. The method that was developed picked up the thermophilic proteins with an accuracy of 94.0 % and 89 %, respectively, for the testing sets of 382 and 73 thermophilic proteins. The accuracy for mesophilic proteins was 85.2 % and 89 %, respectively, for the testing sets of 325 and 73 mesophilic proteins. The influence of specific dipeptides on discrimination was also discussed.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.