Gene Alert--a sequence search results keyword parser.

H Huang,H.R Garner

doi:10.1109/51.664040

Abstract

Similarity searching is an important tool to many biological scientists. Various computer implementations (BLAST, FASTA, Smith-Waterman) are used by scientists to analyze their sequences of interest to identify identities (perfect matches) or similarities (statistically significant matches) between their query sequences and large databases such as GenBank. Search engines currently return brief annotations and alignments ranked in order of statistical significance or raw similarity score. However, it is frequently not the top-scoring similarities that bring important new information to the investigating scientist, but the content of the annotation or similarity hits at any significant score. The Gene Alert algorithm applies additional filtering and a user weighted keyword search to the BLAST output to parse the output into a form customized to the user. There are three components to the Gene Alert implementation as it is currently operating: an organized file structure, a BLAST engine, and a parser written in the PERL scripting language. The file structure was designed to place code and database components in logical positions and to facilitate future complete automation of the Gene Alert and similarity search system. Shown here is the file structure within the UNIX environment.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Gene Alert--a sequence search results keyword parser.

Abstract

Talk to us

Similar Papers

More From: IEEE engineering in medicine and biology magazine : the quarterly magazine of the Engineering in Medicine & Biology Society

Lead the way for us

Journal: IEEE engineering in medicine and biology magazine : the quarterly magazine of the Engineering in Medicine & Biology Society	Publication Date: Jan 1, 1998
Citations: 3

Similar Papers

A Space-Efficient Approach towards Distantly Homologous Protein Similarity Searches
...
International Journal of Advanced Research in Computer Science | VOL. 6
, et. al. ...
25 Aug 2015
International Journal of Advanced Research in Computer Science | VOL. 6

Application of kernel functions for accurate similarity search in large chemical databases.
Xiaohong Wang ... Gerald H Lushington
BMC Bioinformatics | VOL. Suppl 11 3
Xiaohong Wang, et. al.Xiaohong Wang ... Gerald H Lushington
01 Apr 2010
BMC Bioinformatics | VOL. Suppl 11 3

Efficient processing of similarity search under time warping in sequence databases: an index-based approach
Sang-Wook Kim ... Wesley W Chu
Information Systems | VOL. 29
Sang-Wook Kim, et. al.Sang-Wook Kim ... Wesley W Chu
05 Jun 2003
Information Systems | VOL. 29

Bit transposition for very large scientific and statistical databases
Harry K T Wong ... Linda Wong
Algorithmica | VOL. 1
Harry K T Wong, et. al.Harry K T Wong ... Linda Wong
01 Nov 1986
Algorithmica | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Gene Alert--a sequence search results keyword parser.

Abstract

Talk to us

Similar Papers

More From: IEEE engineering in medicine and biology magazine : the quarterly magazine of the Engineering in Medicine & Biology Society