Abstract
Globular proteins (GPs) play vital roles in a wide range of biological processes, encompassing enzymatic catalysis and immune responses. Enzymes, among these globular proteins, facilitate biochemical reactions, while others, such as haemoglobin, contribute to essential physiological functions such as oxygen transport. Given the importance of these considerations, accurately identifying Globular proteins is essential. To address the need for precise GP identification, this research introduces an innovative approach that employs a hybrid-based deep learning model called Deep-GP. We generated two datasets based on primary sequences and developed a novel feature descriptor called, Consensus Sequence-based Trisection-Position Specific Scoring Matrix (CST-PSSM). The model training phase involved the application of deep learning techniques, including the bidirectional long short-term memory network (BiLSTM), gated recurrent unit (GRU), and convolutional neural network (CNN). The BiLSTM and CNN were hybridised for ensemble learning. The CST-PSSM-based ensemble model achieved the most accurate predictive outcomes, outperforming other competitive predictors across both training and testing datasets. This demonstrates the potential of harnessing deep learning for precise GB prediction as a robust tool to expedite research, streamline drug discovery, and unveil novel therapeutic targets.
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.