Overlapping translation of nucleic acid sequences for bioinformatics applications

Jan Charles Biro

doi:10.1016/s0306-9877(03)00008-2

Abstract

Summary: An alternative method to TblastX has been developed. Nucleic acids in database and query sequences were translated into overlapping protein-like sequences (overlappingly translated sequences or OTSs) before searching with BlastP. Thus, each nucleic acid sequences is represented by a single ‘protein like’ sequence instead of three ‘proteins’ in different reading frames. The 3×3 comparison of TblastX is represented by a single comparison, giving faster results. Additional advantages are: (1) it can be more sensitive to detect weak sequence similarities than either blastN or TblastX; (2) codon redundancy is eliminated; (3) the sensitivity to single nucleotide polymorphism, mutation and sequencing errors is reduced; (4) it is insensitive to frame shifts. Results: BlastP using OTS detected about two thirds of blastN and TblastX matches but discovered additional similarities. When blastN and TblastX against nucleic acids were compared to blastP against OTS, identical matches discovered by blastP were generally longer (602, respectively. 213 letters, p<0.01), had higher scores (748 respectively 460 bits, p<0.05) and lower E values (3.16E − 20 vs. 1.17E + 03, p<0.01) but the percentage identity was lower (25% respectively 61%, p<0.001). A qualitative evaluation with LALIGN showed an improvement of the visualization when OTS-s were used instead of nucleic acids. Many extensive sequence similarities became better visible, for example the repeating similarity between prion protein and human insulin gene micro-satellite, and the surprising similarity between the first part of prion protein coding region and the human pro-insulin (34.4% identity and additional 17.2% similarity through 238 residues, score >295 which is expected 4.6e − 18 times by chance).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Overlapping translation of nucleic acid sequences for bioinformatics applications

Abstract

Talk to us

Similar Papers

More From: Medical Hypotheses

Lead the way for us

Journal: Medical Hypotheses	Publication Date: Mar 6, 2003
Citations: 5

Similar Papers

A novel sequence similarity searching and visualization method based on overlappingly translated nucleic acids: the blastNP
Jan C Biro ... Josephine M.K Biro
Medical Hypotheses | VOL. 62
Jan C Biro, et. al.Jan C Biro ... Josephine M.K Biro
03 Feb 2004
Medical Hypotheses | VOL. 62

Deadly Conformations—Protein Misfolding in Prion Disease
Arthur L Horwich ... Jonathan S Weissman
Cell | VOL. 89
Arthur L Horwich, et. al.Arthur L Horwich ... Jonathan S Weissman
01 May 1997
Cell | VOL. 89

The BlastNP: a novel, sensitive sequence similarity searching method using overlappingly translated sequences
J.C Biro ... J.M.K Biro
-
J.C Biro, et. al.J.C Biro ... J.M.K Biro
01 Jan 2004
01 Jan 2004

Comparative analysis of the prion protein ( PrP) gene in cetacean species
Pier Luigi Acutis ... Maria Caramelli
Gene | VOL. 392
Pier Luigi Acutis, et. al.Pier Luigi Acutis ... Maria Caramelli
12 Jan 2007
Gene | VOL. 392

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Overlapping translation of nucleic acid sequences for bioinformatics applications

Abstract

Talk to us

Similar Papers

More From: Medical Hypotheses