Structural analysis of regulatory DNA sequences using grammar inference and Support Vector Machine

Robertas Damaševičius

doi:10.1016/j.neucom.2009.09.018

Abstract

Regulatory DNA sequences such as promoters or splicing sites control gene expression and are important for successful gene prediction. Such sequences can be recognized by certain patterns or motifs that are conserved within a species. These patterns have many exceptions which makes the structural analysis of regulatory sequences a complex problem. Grammar rules can be used for describing the structure of regulatory sequences; however, the manual derivation of such rules is not trivial. In this paper, stochastic L-grammar rules are derived automatically from positive examples and counterexamples of regulatory sequences using genetic programming techniques. The fitness of grammar rules is evaluated using a Support Vector Machine (SVM) classifier. SVM is trained on known sequences to obtain a discriminating function which serves for evaluating a candidate grammar ruleset by determining the percentage of generated sequences that are classified correctly. The combination of SVM and grammar rule inference can mitigate the lack of structural insight in machine learning approaches such as SVM.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Structural analysis of regulatory DNA sequences using grammar inference and Support Vector Machine

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Nov 22, 2009
Citations: 26

Similar Papers

Structural Analysis of Promoter Sequences Using Grammar Inference and Support Vector Machine
Robertas Damaševičius
-
Robertas DamaševičiusRobertas Damaševičius
03 Sep 2008
03 Sep 2008

Optimization of SVM parameters for recognition of regulatory DNA sequences
Robertas Damaševičius
TOP | VOL. 18
Robertas DamaševičiusRobertas Damaševičius
25 Aug 2010
TOP | VOL. 18

Analysis of Plant Regulatory DNA Sequences by Transient Protoplast Assays and Computer Aided Sequence Evaluation
Kenneth W Berendzen ... Dierk Wanke
-
Kenneth W Berendzen, et. al.Kenneth W Berendzen ... Dierk Wanke
01 Jan 2009
01 Jan 2009

Analysis of Plant Regulatory DNA sequences by the Yeast-One-Hybrid Assay
Dierk Wanke ... Klaus Harter
-
Dierk Wanke, et. al.Dierk Wanke ... Klaus Harter
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Structural analysis of regulatory DNA sequences using grammar inference and Support Vector Machine

Abstract

Talk to us

Similar Papers

More From: Neurocomputing