CLPred: a sequence-based protein crystallization predictor using BLSTM neural network.

Wenjing Xuan,Jianxin Wang,Ning Liu,Yaohang Li,Neng Huang

doi:10.1093/bioinformatics/btaa791

Abstract

Determining the structures of proteins is a critical step to understand their biological functions. Crystallography-based X-ray diffraction technique is the main method for experimental protein structure determination. However, the underlying crystallization process, which needs multiple time-consuming and costly experimental steps, has a high attrition rate. To overcome this issue, a series of in silico methods have been developed with the primary aim of selecting the protein sequences that are promising to be crystallized. However, the predictive performance of the current methods is modest. We propose a deep learning model, so-called CLPred, which uses a bidirectional recurrent neural network with long short-term memory (BLSTM) to capture the long-range interaction patterns between k-mers amino acids to predict protein crystallizability. Using sequence only information, CLPred outperforms the existing deep-learning predictors and a vast majority of sequence-based diffraction-quality crystals predictors on three independent test sets. The results highlight the effectiveness of BLSTM in capturing non-local, long-range inter-peptide interaction patterns to distinguish proteins that can result in diffraction-quality crystals from those that cannot. CLPred has been steadily improved over the previous window-based neural networks, which is able to predict crystallization propensity with high accuracy. CLPred can also be improved significantly if it incorporates additional features from pre-extracted evolutional, structural and physicochemical characteristics. The correctness of CLPred predictions is further validated by the case studies of Sox transcription factor family member proteins and Zika virus non-structural proteins. https://github.com/xuanwenjing/CLPred.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CLPred: a sequence-based protein crystallization predictor using BLSTM neural network.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Journal: Bioinformatics	Publication Date: Dec 29, 2020
Citations: 14

Similar Papers

Riboflow: Using Deep Learning to Classify Riboswitches With ∼99% Accuracy.
Keshav Aditya R Premkumar ... Ramit Bharanikumar
Frontiers in bioengineering and biotechnology | VOL. 8
Keshav Aditya R Premkumar, et. al.Keshav Aditya R Premkumar ... Ramit Bharanikumar
14 Jul 2020
Frontiers in bioengineering and biotechnology | VOL. 8

DeepCrystal: A Deep Learning Framework for Sequence-based Protein Crystallization Prediction
Abdurrahman Elbasir ... Halima Bensmail
-
Abdurrahman Elbasir, et. al.Abdurrahman Elbasir ... Halima Bensmail
01 Dec 2018
01 Dec 2018

DeepCrystal: a deep learning framework for sequence-based protein crystallization prediction.
Abdurrahman Elbasir ... Prasanna R Kolatkar
Bioinformatics | VOL. 35
Abdurrahman Elbasir, et. al.Abdurrahman Elbasir ... Prasanna R Kolatkar
21 Nov 2018
Bioinformatics | VOL. 35

Evaluating the Performance of Various Deep Reinforcement Learning Algorithms for a Conversational Chatbot
R Rajamalli Keerthana ... G Fathima
-
R Rajamalli Keerthana, et. al.R Rajamalli Keerthana ... G Fathima
21 May 2021
21 May 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CLPred: a sequence-based protein crystallization predictor using BLSTM neural network.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics