DeepCrystal: A Deep Learning Framework for Sequence-based Protein Crystallization Prediction

Abdurrahman Elbasir,Prasanna R Kolatkar,Balasubramanian Moovarkumudalvan,Khalid Kunji,Raghvendra Mall,Halima Bensmail

doi:10.1109/bibm.2018.8621202

Abstract

Protein structure determination has primarily been performed using X-ray crystallography. To overcome the expensive cost, high attrition rate and series of trial-and-error settings, many in-silico methods have been developed to predict crystallization propensities of proteins based on their sequences. However, majority of these methods build predictors by extracting features from protein sequences which is computationally expensive and can potentially explode the feature space. We propose, DeepCrystal, a deep learning framework for sequence-based protein crystallization prediction. It uses deep learning to identify proteins which can produce diffraction quality crystals without the need to manually engineer additional biochemical and structural features from sequence. Our model is based on Convolutional Neural Networks (CNNs) which can exploit frequently occurring k-mers and sets of k-mers from the protein sequences to discriminate diffraction quality crystals from non-crystallizable ones. Our model outperforms previous sequence-based protein crystallization predictors in terms of recall, F-score, accuracy and MCC on three independent test sets. DeepCrystal achieves an average improvement of 1.4 %, 12.1% in recall, when compared to its closest competitors, Crysalis II and Crysf respectively. In addition, DeepCrystal attains an average improvement of 2.1%, 6.0% for F-score, 1.9%, 3.9% for accuracy and 3.8%, 7.0% for MCC respectively w.r.t. Crysalis II and Crysf on independent test sets. The standalone source code and models are available at https://github.com/elbasir/DeepCrystal and a web-server is also available at https://deeplearning-protein.qcri.org.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DeepCrystal: A Deep Learning Framework for Sequence-based Protein Crystallization Prediction

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

DeepCrystal: a deep learning framework for sequence-based protein crystallization prediction.
Abdurrahman Elbasir ... Prasanna R Kolatkar
Bioinformatics | VOL. 35
Abdurrahman Elbasir, et. al.Abdurrahman Elbasir ... Prasanna R Kolatkar
21 Nov 2018
Bioinformatics | VOL. 35

CLPred: a sequence-based protein crystallization predictor using BLSTM neural network.
Wenjing Xuan ... Neng Huang
Bioinformatics | VOL. 36
Wenjing Xuan, et. al.Wenjing Xuan ... Neng Huang
29 Dec 2020
Bioinformatics | VOL. 36

Deep Learning Improves Speed and Accuracy of Prostate Gland Segmentations on Magnetic Resonance Imaging for Targeted Biopsy.
Simon John Christoph Soerensen ... Geoffrey A Sonn
Journal of Urology | VOL. 206
Simon John Christoph Soerensen, et. al.Simon John Christoph Soerensen ... Geoffrey A Sonn
21 Apr 2021
Journal of Urology | VOL. 206

AI‐BRAFV600E: A deep convolutional neural network for BRAFV600E mutation status prediction of thyroid nodules using ultrasound images
Chuang Xi ... Xuan Zheng
VIEW | VOL. 4
Chuang Xi, et. al.Chuang Xi ... Xuan Zheng
16 Jan 2023
VIEW | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DeepCrystal: A Deep Learning Framework for Sequence-based Protein Crystallization Prediction

Abstract

Talk to us

Similar Papers