Sequence search algorithm assessment and testing toolkit (SAT).

Jong Park,Cyrus Chothia,Liisa Holm

doi:10.1093/bioinformatics/16.2.104

Abstract

The Sequence Search Algorithm Assessment and Testing Toolkit (SAT) aims to be a complete package for the comparison of different protein homology search algorithms. The structural classification of proteins can provide us with a clear criterion for judgment in homology detection. There have been several assessments based on structural sequences with classifications but a good deal of similar work is now being repeated with locally developed procedures and programs. The SAT will provide developers with a complete package which will save time and produce more comparable performance assessments for search algorithms. The package is complete in the sense that it provides a non-redundant large sequence resource database, a well-characterized query database of proteins domains, all the parsers and some previous results from PSI-BLAST and a hidden markov model algorithm. An analysis on two different data sets was carried out using the SAT package. It compared the performance of a full protein sequence database (RSDB100) with a non-redundant representative sequence database derived from it (RSDB50). The performance measurement indicated that the full database is sub-optimal for a homology search. This result justifies the use of much smaller and faster RSDB50 than RSDB100 for the SAT. A web site is up. The whole packa ge is accessible via www and ftp. ftp://ftp.ebi.ac.uk/pub/contrib/jong/SAT http://cyrah.ebi.ac.uk:1111/Proj/Bio/SAT http://www.mrc-lmb.cam.ac.uk/genomes/SAT In the package, some previous assessment results produced by the package can also be found for reference. jong@ebi.ac.uk

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Sequence search algorithm assessment and testing toolkit (SAT).

Abstract

Talk to us

Similar Papers

More From: Bioinformatics (Oxford, England)

Lead the way for us

Journal: Bioinformatics (Oxford, England)	Publication Date: Feb 1, 2000
Citations: 24

Similar Papers

De novo transcriptome analysis and identification of genes associated with immunity, detoxification and energy metabolism from the fat body of the tephritid gall fly, Procecidochares utilis.
Lifang Li ... Mengyue Li
PloS one | VOL. 14
Lifang Li, et. al.Lifang Li ... Mengyue Li
17 Dec 2019
PloS one | VOL. 14

Removing near-neighbour redundancy from large protein sequence collections.
L Holm ... C Sander
Bioinformatics | VOL. 14
L Holm, et. al.L Holm ... C Sander
01 Jun 1998
Bioinformatics | VOL. 14

Type II restriction endonuclease R.Hpy188I belongs to the GIY-YIG nuclease superfamily, but exhibits an unusual active site
Katarzyna H Kaminska ... Ichizo Kobayashi
BMC Structural Biology | VOL. 8
Katarzyna H Kaminska, et. al.Katarzyna H Kaminska ... Ichizo Kobayashi
14 Nov 2008
BMC Structural Biology | VOL. 8

A Pitfall of Iterative Ψ-Blast
Natalya S Bogatyreva
Journal of Molecular Biomedical Research | VOL. 1
Natalya S BogatyrevaNatalya S Bogatyreva
01 Jan 2024
Journal of Molecular Biomedical Research | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sequence search algorithm assessment and testing toolkit (SAT).

Abstract

Talk to us

Similar Papers

More From: Bioinformatics (Oxford, England)