Identification of 5’UTR Splicing Site Using Sequence and Structural Specificities Based on Combination Statistical Method with SVM

Lv Jun-Jie,Xiong Xin-Yan,Feng Wei-Xing,Wang Xin,Wang Ke-Jun

doi:10.12785/amis/071l14

Abstract

To identify untranslated regions (UTR) splice sites more accurately and efficiently, a method for the recognition of UTR splice sites using both splicing sequences and secondary structures of flank sequence information based on combination statistical method with support vector machine was proposed. The method consists of two stages: a statistical method is used in the first stage and a support vector machine (SVM) with polynomial kernel is used in the second stage. The statistical method serves as a pre-processing step for the SVM and takes UTR sequences as its input. It models the compositional features and dependencies of nucleotides in terms of probabilistic parameters around splice site regions. The probabilistic parameters are then fed into the SVM, which combines them nonlinearly to predict splice sites. Then the Mfold package in Vienna soft was used to predict the most stable secondary structure offlank sequences. The traditional four-letter alphabet was converted into eight-letter alphabet sequence. The sequence- structure combination strings were used for training models then recognized splice sites by the well trained models. Using the actual 5'UTR splice dataset of human gene tested the method; it shows a good performance for UTR splice sites recognition.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Identification of 5’UTR Splicing Site Using Sequence and Structural Specificities Based on Combination Statistical Method with SVM

Abstract

Talk to us

Similar Papers

More From: Applied Mathematics & Information Sciences

Lead the way for us

Journal: Applied Mathematics & Information Sciences	Publication Date: Feb 1, 2013
Citations: 10

Similar Papers

Splice site identification using probabilistic parameters and SVM classification.
Akma Baten ... Sk Halgamuge
BMC Bioinformatics | VOL. Suppl 7 5
Akma Baten, et. al.Akma Baten ... Sk Halgamuge
01 Dec 2006
BMC Bioinformatics | VOL. Suppl 7 5

Role of the branch site/3'-splice site region in adenovirus-2 E1A pre-mRNA alternative splicing: evidence for 5'- and 3'-splice site co-operation.
Per Johan Ulfendahl ... Goran Akusjärvi
Nucleic acids research | VOL. 17
Per Johan Ulfendahl, et. al.Per Johan Ulfendahl ... Goran Akusjärvi
01 Jan 1989
Nucleic acids research | VOL. 17

Effect of growth hormone on levels of differentially processed insulin-like growth factor I mRNAs in total and polysomal mRNA populations.
H L Foyt ... M Woloschak
Molecular endocrinology (Baltimore, Md.) | VOL. 6
H L Foyt, et. al.H L Foyt ... M Woloschak
01 Nov 1992
Molecular endocrinology (Baltimore, Md.) | VOL. 6

Extensive interactions of PRP8 protein with the 5' and 3' splice sites during splicing suggest a role in stabilization of exon alignment by U5 snRNA.
S Teigelkamp ... A.J Newman
The EMBO Journal | VOL. 14
S Teigelkamp, et. al.S Teigelkamp ... A.J Newman
01 Jun 1995
The EMBO Journal | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Identification of 5’UTR Splicing Site Using Sequence and Structural Specificities Based on Combination Statistical Method with SVM

Abstract

Talk to us

Similar Papers

More From: Applied Mathematics &amp; Information Sciences

More From: Applied Mathematics & Information Sciences