An Experimentally Derived Data Set Constructed for Testing Large-Scale DNA Sequence Assembly Algorithms

Donald Seto,Leroy Hood,Ben F Koop

doi:10.1006/geno.1993.1123

Abstract

A data set consisting of DNA sequences from a large-scale shotgun DNA cloning and sequencing project has been collected and posted for public release. The purpose is to propose a standard genomic DNA sequencing data set by which various algorithms and implementations can be tested. This set of data is divided into two subsets, one containing raw DNA sequence data (1023 clones) and the other consisting of the corresponding partially refined or edited DNA sequence data (820 clones). Suggested criteria or guidelines for this data refinement are presented so that algorithms for preprocessing and screening raw sequences may be developed. Development of such preprocessing, screening, aligning, and assembling algorithms will expedite large-scale DNA sequencing projects so that the complete unambiguous consensus DNA sequences will be made available to the general research community in a quicker manner. Smaller scale routine DNA sequencing projects will also be greatly aided by such computational efforts.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Experimentally Derived Data Set Constructed for Testing Large-Scale DNA Sequence Assembly Algorithms

Abstract

Talk to us

Similar Papers

More From: Genomics

Lead the way for us

Similar Papers

Partial CviJI digestion as an alternative approach to generate cosmid sublibraries for large-scale sequencing projects.
Jeffrey C Gingrich ... Subha B Basu
BioTechniques | VOL. 21
Jeffrey C Gingrich, et. al.Jeffrey C Gingrich ... Subha B Basu
01 Jul 1996
BioTechniques | VOL. 21

Human genetics special issue on computational molecular medicine.
Rachel Karchin ... Melissa S Cline
Human Genetics | VOL. 134
Rachel Karchin, et. al.Rachel Karchin ... Melissa S Cline
25 Mar 2015
Human Genetics | VOL. 134

Automated DNA Sequencing and Analysis
...
-
, et. al. ...
01 Jan 1993
01 Jan 1993

Statement on the rapid release of genomic DNA sequence.
Notes From The Meeting ... Statement Compiled By Mark Guyer
Genome research | VOL. 8
Notes From The Meeting, et. al.Notes From The Meeting ... Statement Compiled By Mark Guyer
01 May 1998
Genome research | VOL. 8

Journal: Genomics	Publication Date: Mar 1, 1993
Citations: 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Experimentally Derived Data Set Constructed for Testing Large-Scale DNA Sequence Assembly Algorithms

Abstract

Talk to us

Similar Papers

More From: Genomics