Abstract

In computational biology, the Protein Remote homology Detection technique (PRHD) has got undeniable significance. It is mostly important for structure and function identification of a protein sequence. The previous years have seen a challenge that lacks postulating a correlation among the sequences. However, the sequences are of variable length. Thereby, it inhibits the proper derivation of evolutionary information among the sequences. The challenges are the usage of physico-chemical properties as a source to get the evolutionary information and the number of sequences generated every day. This however facilitates a new technique to integrate huge amount of data with a massive feature set. In this article, a new and efficient technique is proposed to predict homology for distantly located sequences of proteins. Deep neural network(CNN-GRU model) is used for the classification of the protein sequences. This is based on different protein families and methods of feature extraction.The efficiency of the proposed model DeepRHD is tested on average 8000 sequences per superfamily taken from SCOP benchmark dataset and the results shows that the proposed model is better than other state of art methods. This model is useful in detecting diseases like sickle cell anemia and influenza and developing a drug thereafter.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call