Identification Of DNA-binding Proteins Research Articles

BackgroundDNA-binding proteins are vital for the study of cellular processes. In recent genome engineering studies, the identification of proteins with certain functions has become increasingly important and needs to be performed rapidly and efficiently. In previous years, several approaches have been developed to improve the identification of DNA-binding proteins. However, the currently available resources are insufficient to accurately identify these proteins. Because of this, the previous research has been limited by the relatively unbalanced accuracy rate and the low identification success of the current methods.ResultsIn this paper, we explored the practicality of modelling DNA binding identification and simultaneously employed an ensemble classifier, and a new predictor (nDNA-Prot) was designed. The presented framework is comprised of two stages: a 188-dimension feature extraction method to obtain the protein structure and an ensemble classifier designated as imDC. Experiments using different datasets showed that our method is more successful than the traditional methods in identifying DNA-binding proteins. The identification was conducted using a feature that selected the minimum Redundancy and Maximum Relevance (mRMR). An accuracy rate of 95.80% and an Area Under the Curve (AUC) value of 0.986 were obtained in a cross validation. A test dataset was tested in our method and resulted in an 86% accuracy, versus a 76% using iDNA-Prot and a 68% accuracy using DNA-Prot.ConclusionsOur method can help to accurately identify DNA-binding proteins, and the web server is accessible at http://datamining.xmu.edu.cn/~songli/nDNA. In addition, we also predicted possible DNA-binding protein sequences in all of the sequences from the UniProtKB/Swiss-Prot database.Electronic supplementary materialThe online version of this article (doi:10.1186/1471-2105-15-298) contains supplementary material, which is available to authorized users.

Read full abstract

DNA-binding proteins (DBPs), such as transcription factors, constitute about 10% of the protein-coding genes in eukaryotic genomes and play pivotal roles in the regulation of chromatin structure and gene expression by binding to short stretches of DNA. Despite their number and importance, only for a minor portion of DBPs the binding sequence had been disclosed. Methods that allow the de novo identification of DNA-binding motifs of known DBPs, such as protein binding microarray technology or SELEX, are not yet suited for high-throughput and automation. To close this gap, we report an automatable DNA-protein-interaction (DPI)-ELISA screen of an optimized double-stranded DNA (dsDNA) probe library that allows the high-throughput identification of hexanucleotide DNA-binding motifs. In contrast to other methods, this DPI-ELISA screen can be performed manually or with standard laboratory automation. Furthermore, output evaluation does not require extensive computational analysis to derive a binding consensus. We could show that the DPI-ELISA screen disclosed the full spectrum of binding preferences for a given DBP. As an example, AtWRKY11 was used to demonstrate that the automated DPI-ELISA screen revealed the entire range of in vitro binding preferences. In addition, protein extracts of AtbZIP63 and the DNA-binding domain of AtWRKY33 were analyzed, which led to a refinement of their known DNA-binding consensi. Finally, we performed a DPI-ELISA screen to disclose the DNA-binding consensus of a yet uncharacterized putative DBP, AtTIFY1. A palindromic TGATCA-consensus was uncovered and we could show that the GATC-core is compulsory for AtTIFY1 binding. This specific interaction between AtTIFY1 and its DNA-binding motif was confirmed by in vivo plant one-hybrid assays in protoplasts. Thus, the value and applicability of the DPI-ELISA screen for de novo binding site identification of DBPs, also under automatized conditions, is a promising approach for a deeper understanding of gene regulation in any organism of choice.

Read full abstract

Identification Of DNA-binding Proteins Research Articles

Related Topics

Articles published on Identification Of DNA-binding Proteins

NDNA-Prot: identification of DNA-binding proteins based on unbalanced classification.

EnDNA-Prot: identification of DNA-binding proteins by applying ensemble learning.

Screening for Protein-DNA Interactions by Automatable DNA-Protein Interaction ELISA

Identification of DNA-Binding and Protein-Binding Proteins Using Enhanced Graph Wavelet Features

Identification of centromeric and telomeric DNA-binding proteins in rice

Identification of DNA-Binding Proteins Using Support Vector Machine with Sequence Information

Computational Methods for DNA-binding Protein and Binding Residue Prediction

Toward detection of DNA-bound proteins using solid-state nanopores: insights from computer simulations.

Unbiased Discovery of Interactions at a Control Locus Driving Expression of the Cancer-Specific Therapeutic and Diagnostic Target, Mesothelin

Identification of novel DNA-binding proteins using DNA-affinity chromatography/pull down.

Identification of essential and non-essential single-stranded DNA-binding proteins in a model archaeal organism

IDNA-Prot: Identification of DNA Binding Proteins Using Random Forest with Grey Model

IDBPs: a web server for the identification of DNA binding proteins

Understanding Protein-DNA Interactions through Dynamics

Global Analysis of a Plasmid-Cured Shigella flexneri Strain: New Insights into the Interaction between the Chromosome and a Virulence Plasmid

DNA-Prot: Identification of DNA Binding Proteins from Protein Sequence Information using Random Forest

Identification of DNA-binding Proteins Using Structural, Electrostatic and Evolutionary Features

Exploring DNA-Binding Proteins with In Vivo Chemical Cross-Linking and Mass Spectrometry

A Proteomics Approach for Identification of Single Strand DNA-binding Proteins Involved in Transcriptional Regulation of Mouse μ Opioid Receptor Gene

Identification of DNA-binding proteins on human umbilical vein endothelial cell plasma membrane.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Identification Of DNA-binding Proteins Research Articles

Related Topics

Articles published on Identification Of DNA-binding Proteins

NDNA-Prot: identification of DNA-binding proteins based on unbalanced classification.

EnDNA-Prot: identification of DNA-binding proteins by applying ensemble learning.

Screening for Protein-DNA Interactions by Automatable DNA-Protein Interaction ELISA

Identification of DNA-Binding and Protein-Binding Proteins Using Enhanced Graph Wavelet Features

Identification of centromeric and telomeric DNA-binding proteins in rice

Identification of DNA-Binding Proteins Using Support Vector Machine with Sequence Information

Computational Methods for DNA-binding Protein and Binding Residue Prediction

Toward detection of DNA-bound proteins using solid-state nanopores: insights from computer simulations.

Unbiased Discovery of Interactions at a Control Locus Driving Expression of the Cancer-Specific Therapeutic and Diagnostic Target, Mesothelin

Identification of novel DNA-binding proteins using DNA-affinity chromatography/pull down.

Identification of essential and non-essential single-stranded DNA-binding proteins in a model archaeal organism

IDNA-Prot: Identification of DNA Binding Proteins Using Random Forest with Grey Model

IDBPs: a web server for the identification of DNA binding proteins

Understanding Protein-DNA Interactions through Dynamics

Global Analysis of a Plasmid-Cured Shigella flexneri Strain: New Insights into the Interaction between the Chromosome and a Virulence Plasmid

DNA-Prot: Identification of DNA Binding Proteins from Protein Sequence Information using Random Forest

Identification of DNA-binding Proteins Using Structural, Electrostatic and Evolutionary Features

Exploring DNA-Binding Proteins with In Vivo Chemical Cross-Linking and Mass Spectrometry

A Proteomics Approach for Identification of Single Strand DNA-binding Proteins Involved in Transcriptional Regulation of Mouse μ Opioid Receptor Gene

Identification of DNA-binding proteins on human umbilical vein endothelial cell plasma membrane.